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RATING SCALES, DISCRIMINABILITY, AND 
INFORMATION TRANSMISSION’ 


W. R. GARNER 


Johns Hopkins University 


Rating scales have been so widely 
used in psychology, both as research 
tools and in practical applications, that 
the determination of the optimum num- 
ber of rating categories has been an im- 
portant research problem in itself. The 
usual attack on this problem has been 
to determine the effect of number of 
rating categories on interrater reliabil- 
ity, and research on these lines has been 
well summarized by Guilford (1954, p. 
289 ff.). 

The purpose of a rating scale is to 
allow a rater or group of raters to dem- 
onstrate perceptual discrimination of a 
set of stimuli or objects to be rated. In 
other words, our concern is not with 
the rating scale per se, but rather with 
the extent to which different rating 
scales allow us to measure discrimina- 
tion between the stimuli rated. And 
the basic question about number of rat- 
ing categories is simply whether for any 
given rating problem there is an opti- 


1 This research was done in part under 
Contract N5-ori-166, Task Order 1, and in 
part under Contract Nonr-248(55), between 
the Office of Naval Research and Johns 
Hopkins University. This is Report No. 6 
under the latter contract. Reproduction in 
whole or in part is permitted for any purpose 
of the United States Government 
tion is due to Robert Austin, Allen Madans, 
and Natalia Potanin for aid in various cal- 
culations 
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mum number of rating categories, or at 
least a number of rating categories be- 
yond which there is no further im- 
provement in discrimination of the 
rated objects. 

Techniques directly concerned with 
discriminability of stimuli have rarely 
been used for this problem, and yet 
since the problem basically is one of 
stimulus discrimination, it seems most 
appropriate that they should be. 

Two techniques which have been suc- 
cessful in dealing with related problems 
are the use of information transmission 
measures and the use of discriminabil- 
ity scaling procedures. Garner and 
Hake (1951) described how informa- 
tion transmission measures can be used 
with the absolute judgment problem to 
determine the effective discrimination 
between the stimuli of a set. While 
there are ramifications of the absolute 
judgment procedure which are differ- 
ent from those of the rating scale situa- 
tion, the problems are sufficiently alike 
that we can expect information trans- 
mission measures to be useful. Bendig 
and Hughes (1953) and Bendig (1954) 
have in fact used the information par- 
adigm for the rating scale problem, and 
came to the conclusion that discrimina- 
tion increases with an increase in num- 
ber of rating categories up to at least 
11 categories. 
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In the same paper, Garner and Hake 
(1951) described a discriminability 
scaling procedure and pointed out the 
relatedness of this procedure to the use 
of information transmission measures. 
To our knowledge, however, discrimi- 
nability scaling procedures have not 
been used in determining the optimum 
number of rating categories. 

The purpose of the present paper is 
to show how both information measures 
and discriminability scaling procedures 
can be used to aid in the solution of the 
rating scale problem. In addition, it 
will be shown that these two proce- 
dures are quite closely related, and that 
measures from one can be used to esti- 
mate the other. An illustrative set of 
data will help clarify the nature of these 
relations. 


THE DaTA 


The data used in these experiments 
were ratings of the legibility of hand- 
writing samples. We obtained 20 dif- 
ferent samples of handwriting of “The 
quick brown fox jumped over the lazy 
dogs,” chosen to give us a reasonable 
spread of legibilities. The samples were 
arranged on a single mimeographed 
sheet, one toa line. A total of 50 raters 
then rated each sample on a rating scale 
with 4, 6, 8, 10, 12, 16, and 20 cat- 
egories. Each rater rated all samples 
with each rating scale, using one rating 
scale until all samples were rated on 
that scale. Different raters used the 
rating scales in different counterbal- 
anced orders, and the samples of hand- 
writing were randomized differently on 


each sheet to minimize any time errors. 
The raters were randomly selected 
from available graduate and undergrad- 


uate students. Since each of 50 raters 
rated each of 20 samples of handwriting 
on each of seven different rating scales, 
we had a total 
basic data. 


if 7,000 ratings as our 
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INFORMATION TRANSMISSION 


The first analysis was to determine 
the information transmission, U(R:S), 
between stimuli, or samples, and re- 
sponses for each of the rating scales. 
(While we are using the term informa- 
tion transmission, we will use the more 
general notation of Garner and McGill, 
1956, in which U(R:S) is a contingent 
uncertainty between responses and stim- 
uli.) All 50 ratings for each sample 
were pooled for this analysis. Since 
U(R:S) is limited by the number of 
response categories, the various rating 
allowed maximum information 
transmissions from 2.00 bits for four 
categories to 4.32 bits for 20 categories. 

The open squares of Figure 1 are 
the information transmission measures 
computed from the basic data, and these 
data show a consistent rise in informa- 
tion transmission as the number of rat- 
ing categories is increased. This result 
is in agreement with the results of 
Bendig and Hughes (1953) and Ben- 
dig (1954). These calculations, how- 
ever, are subject to a bias in the infor- 
mation measure which changes progres- 
sively with number of rating categories. 


scales 


1.00 








INFORMATION 
TRANSMISSION, U(R:S) 











NUMBER OF RATING CATEGORIES 


Fic. 1. Relation between number of rat- 
ing categories and information transmission. 
(The open squares represent the direct meas- 
ures of information transmission. The open 
circles represent the same data corrected for 
bias by the Miller-Madow correction. The 
two lines are least-squares fits for these two 
sets of measures. The closed circles repre- 
sent estimates of information transmission 
obtained from discriminability functions.) 
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Each matrix has the same total number 
of ratings, but when.the number of rat- 
ing categories is increased, these rat- 
ings are distributed more thinly over 
the larger number of cells in the matrix. 
Miller (1955) has described a correc- 
tion for this bias which is known as the 
Miller-Madow correction, and had him- 
self suggested (Quastler, 1955, fn., p. 
16) that Bendig’s results would have 
been different if the correction for bias 
had been applied. 

The correction for bias of informa- 
tion transmission is always in the direc- 
tion of decreasing the estimate of trans- 
mission, and the fewer the number of 
cases per cell, the greater the correc- 
tion ; i.e., the greater does the directly 
calculated information transmission ex- 
ceed the true value. When the Miller- 
Madow correction is applied, the result 
is that shown by the open circles of 
Figure 1. This result, which presents 


more nearly the true picture, shows 
considerably less effect of number of 


rating categories than does the result 
using raw measures. The correction 
flattens out the function. However, it 
is clear that there still is some effect 
of number of rating categories, and that 
an increase in the number of categories 
increases information transmission even 
up to 20 categories. 

The two lines drawn in Figure 1 are 
linear functions, but they should be 
considered as entirely empirical de- 
scription, since logically the functions 
cannot be linear much beyond the range 
of conditions investigated. For ex- 
ample, if only one rating category is 
used, there can be no information trans- 
mission, but the functions drawn do 
not extrapolate to zero with one rating 
category: Somewhere below four cat- 
egories they must slope rapidly down- 
wards. 

On the other hand, there cannot be 
a meaningful increase in information 
transmission if more than 20 rating 


categories are used. With 20 or more 
rating categories, each rater can put 
each stimulus in a different category, 
and the rater can evidence no more 
discrimination than to rate each stim- 
ulus differently. And clearly the de- 
gree of discrimination by the group is 
limited by the discrimination of each 
rater. Actually the directly computed 
information transmission can increase 
with an increase in number of rating 
categories until each rater-sample com- 
bination produces a different rating. 
The function corrected for bias, how- 
ever, would not increase concomitantly, 
since the bias correction would increase 
at approximately the same rate as the 
increase in number of rating categories. 
In the limit, the information transmis- 
sion corrected for bias becomes degen- 
erate, since with a large enough number 
of categories and a small enough num- 
ber of ratings, it is impossible to dem- 
onstrate that the measured information 
transmission is different from the value 
expected on the assumption of no true 
contingency. 


DISCRIMINABILITY FUNCTIONS 


The second method of determining 
the amount of discrimination in the 
ratings is to construct discriminability 
scales for the set of stimuli. ' The dis- 
persion of the scale values is then a 
measure of the relative discrimination 
between the various stimuli. We have 
carried out the necessary scaling pro- 
cedures for these data in two different 
ways. 


Pooled Data 


With the fifst method, the data 
were kept in the pooled form in which 
they were used to compute information 
transmission. The S-R matrix of fre- 
quencies was converted into a set of 
equivalent pair comparisons with the 
technique described by Guilford (1938). 
The essence of this procedure is that 
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the distribution of responses for each 
stimulus is compared category by cat- 
egory with the distributions for each of 
the other stimuli to obtain a measure of 
the equivalent number of times that 
each stimulus would have been rated 
greater than each other stimulus, as- 
suming the ratings to be uncorrelated. 
From the matrix of equivalent pair 
comparisons, a discriminability scale 
was then constructed following the 
usual procedure for pair comparisons. 
One such discriminability scale was 
constructed for each rating scale. 


Individual Data 


With the second method, the ratings 
were changed to equivalent pair com- 
parisons one rater at a time. Thus 
each rater’s ratings of the 20 stimuli 
on one rating scale were changed into 
equivalent pair judgments in much the 
same way as Guilford (1954, p. 183 
ff.) describes for converting ranks. 


Ties were used where necessary. These 
individual pair comparisons were then 
pooled for all 50 raters, and a dis- 
criminability scale constructed—again 


one for each rating scale. We thus 
have 14 discriminability scales, one for 
each of seven rating scales with each 
method. 

The purpose in carrying out the two 
kinds of analysis was to determine 
whether the pooling of the data for the 
informational analysis had had any sys- 
tematic effect on the function relating 
information transmission to number of 
rating categories. With a complete 
multivariate analysis of data such as 
these, we can determine separately the 
extent to which responses can be pre- 
dicted from samples, from raters, and 
from the interaction between samples 
and raters. With only one rating per 
sample per rater, however, we have no 
choice but to pool the data, and are in 
effect accepting the interaction term as 
an error term rather than as a predict- 
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ability term. We are in effect also 
accepting the contingent uncertainty be- 
tween responses and raters as an error 
term. This last factor, which indicates 
the extent to which different raters 
use different responses on the average, 
could change with an increase in num- 
ber of rating categories. For example, 
if all raters used rankings when 20 cat- 
egories were allowed, then there could 
be no difference in use of responses by 
different raters, but such differences 
could exist with smaller members of 
categories. Thus, although it is not 
feasible to measure such a term di- 
rectly, it is possible that the effect 
of this factor systematically decreased 
with the increase in number of rating 
categories, with the resultant increase 
in the information transmission which 
was obtained. 

The first discriminability scale anal- 
ysis used the pooled ratings to convert 
to equivalent pair comparisons. Thus 
any systematic differences between rat- 
ers in ratings would still affect the final 
scale values. With the second scaling 
procedure, any such differences . were 
removed by changing the data to equiv- 
alent pair comparisons before pooling. 


Correlations of the Scales 


The various scales produced by the 
two different methods are essentially 
identical. When the scale values are 
correlated between the two data proc- 
essing methods for each of the seven 
rating scales, the seven correlation co- 
efficients are all above .99. In addition, 
the correlations between scales for the 
different rating methods are quite high, 
as shown in Table 1. We have shown 
in this table only the correlations for 
the pooled data. Since all of the corre- 
lations between the two scaling methods 
are so high, a table of correlations using 
the individual data is practically iden- 
tical to Table 1. 

While the correlations between dis- 
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rABLE-1 
CORRELATIONS BETWEEN DISCRIMINABILITY 
SCALE VALUES FOR DIFFERENT NUMBERS 
OF RATING CATEGORIES 


91 

96 | .95 

97 | .95 96 

96 92 97 .97 

94 90 95 96 .97 


Note 
relations. 


Pooled data were used for these cor- 


criminability scale values obtained from 
different numbers of rating categories 
are all quite high, it is clear that rating 
scales similiar in number of categories 
correlate more highly than do widely 
different scales. Thus the 4 scale and 
the 20 scale have the smallest correla- 
tion, while the 16 scale and the 20 scale 


have the highest correlation, along with 
some other close pairs. 


Discriminability 
correlations between 
discriminability scale values show only 
that the various scales produced are 
highly predictable, one from the other. 
They do not show that the total amount 
of discrimination is the same. In order 
to determine the total amount of dis- 
crimination, the simplest procedure is 
to compute the standard deviations of 
the scale values for each of the scales 
The scale whose stimulus values are 
most spread apart is the scale with the 
greatest interstimulus discrimination. 
Consequently, standard deviations have 
been computed for each of the 14 dis- 
criminability scales, and these values 
are plotted in Figure 2. 
These data, like the 
transmission data, show an increase in 
discrimination with an increased num- 
ber of categories on the rating scale. 


These data on 


information 


And while the individual data show 
slightly greater discrimination than do 
the pooled data, the difference is very 
small—mean standard deviation for the 
pooled data is 0.881 and for the indi- 
vidual data it is 0.897. 

What is more important, however, is 
that there is no evidence that the in- 
crease in discrimination with the in- 
crease in number of rating categories 
is in any way due to the pooling of the 
data. Both sets of points show a def- 
inite increase, and only for the 4 cat- 
egory scale is there any appreciable 
difference between the two standard 
deviations. 

Once again the line drawn is a least- 
squares solution which can be consid- 
ered only an empirical description. If 
one rating category is used, the stand- 
ard deviation must be zerb, and the 
curve shown does not extrapolate to 
zero at that point. Likewise, with a 
larger number of rating categories the 
scale values could not increase sub- 
stantially, since no greater precision in 
equivalent pair comparisons could be 
obtained if there are more rating cat- 
egories than there are stimuli. In other 
words, since the discriminabijity scale 
values are obtained from ¢quivalent 
pair comparisons, the larger grange of 








VALUES 


SCALE 


D. OF 








S 





+ 8 12 16 20 24 
NUMBER OF RATING CATEGORIES 


Fic. 2. Relation between number of rating 
categories and standard deviation of the dis- 
criminability scale values. (The open circles 
are for the data processed as a group; the 
closed circles for the data tabulated by in- 
dividuals. The line is a least-squares fit.) 
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numbers in no way affects the standard 
deviations of the scale values. 


ESTIMATES OF INFORMATION 
TRANSMISSION 


We have shown that two alternative 
methods of determining the amount of 
discrimination in a set of stimuli evi- 
denced by a rating procedure lead to 
the same conclusion, namely, that dis- 
crimination increases with an increase 
in number of rating categories, up to 
20 categories in our case. It seems 
very reasonable that these two meas- 
ures should, therefore, be related. It is 
the purpose of this section to show that 
they are related and that one can be 
quite exactly estimated from the other. 

The actual equation for estimating 
transmitted information from the stand- 
ard deviation or variance of the dis- 
criminability scale values is 


estU(R:S) =4 log, [V(S) +1] [1] 


where estU/(R:S) is the estimated in- 
formation transmission and V(S) is 
the variance of the discriminability scale 
values of the stimuli. The S notation 
emphasizes the fact that these are mean 
scale values. 

For the pooled data shown in Figure 
2, an estimate of the information trans- 
mission has been computed for each 
number of rating categories, and these 
values are shown as the closed circles 
of Figure 1. It is obvious that Equa- 
tion 1 provides a very accurate estimate 
of the unbiased information transmis- 
sion, and that the two measures are 
highly related. 

The rationale of Equation 1 can be 
understood from two related points of 
view. First, we can conceptualize the 
discriminability scale values as a set of 
values for a set of stimuli, and this 
same set of values as the mean numer 
ical responses which would be obtained 
if both the stimuli and the responses 
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were metricized in the unit normal sys- 
tem used with discriminability scales. 
In other words, we really have an S-R 
matrix, but one in which these new 
scale values, rather than the respective 
original designations, are used for both 
stimuli and responses. Now with such 
a matrix both the stimulus scale and the 
response scale are continuous, but we 
could compute an information transmis- 
sion figure from the matrix by catego- 
rizing the scales. It is not necessary, 
however, to set up such a categorized 
matrix, since uncertainties can be esti- 
mated from variances (Garner & Mc- 
Gill, 1956) and we can easily determine 
the appropriate variances with no need 
to categorize. 

Information transmission can be com- 
puted from the following equation : 


U(R:S) = U(R) —Us(R), [2] 


where U(R) is the response informa- 
tion or uncertainty, and U,(R) is the 
response information conditional on the 
stimulus, or ‘response equivocation, as 
it is sometimes called. 


estU(R:S) = 4 log, [V(R)] 
—tlog, [Vs(R)] [3] 


In this equation V(R) is the variance 
of the response continuum, and V.(R) 
is the average variance of the responses 
taking one stimulus at a time, i.e., the 
within column variance. The two terms 
on the right side of Equation 3 are the 
estimates of the equivalent uncertainties 
of Equation 2. (Actually, each of these 
terms in Equation 3 should have an 
associated term which reflects the width 
of the category interval, but since these 
terms are the same in both cases, they 
cancel each other—see Garner & Mc- 
Gill, 1956, for further explanation. ) 

Now we know that the value of 
V.(R) is 1.0, since it is so by definition 
im a unit normalized system; therefore 
the logarithm of it is zero. 
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On the other hand, 
V(R) =V(R)+1=V(S) +1. [4] 


V(S) is the variance of the actual stim- 
ulus scale values, but since we are as- 
suming a one-to-one relation between 
the responses and the stimuli (except 
for error variation), the mean response 
for each stimulus is the same as the 
stimulus value itself. Thus the var- 
iance of the means of the responses is 
the same as the variance of the stimulus 
scale values. But the total variance of 
the responses, V(R), will be the var- 
iance of the means plus the variance 
within groups (the error variance in 
analysis of variance terms). Since the 
error variance is 1.00, the total variance 
of the responses in this idealized matrix 
will be V(S) + 1. 

Thus the estimated second term on 
the right of Equation 2 is zero, and the 
first term on the right is directly esti- 
mated as shown in Equation 1. 

The other point of view from which 
to understand Equation 1 is by means 
of correlational analysis. Attneave 
(1959) has pointed out the general 
relation that 


estU(R:S) _ 4 log, (1 — r?) [5] 


Now we can define 


r?=V(S)/(V(S) +1] 


[6] 


This correlation coefficient is stated in 
the usual form as the ratio of the pre- 
dictable variance to the total variance 
of a set of scores, and as shown above, 
the total variance of the responses is 
V(S) + 1, and the predictable variance 
of the responses is V(S). This corre- 
lation coefficient is quite similar to one 
proposed by Gulliksen and Tukey 
(1958, p. 108), except that their co- 
efficient is a reliability of mean scale 
values, while the coefficient proposed 
here is essentially a reliability of indi- 
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vidual scores, and is a straightforward 
variance ratio. 

From the relation of Equation 6 
(1-+r,?) is 1/[V(S) +1], and from 
Equation 5, Equation 1 is directly de- 
rivable. 


Characteristics of the Estimate 


This estimating procedure is quite 
accurate for these data. It should be 
puinted out, however, that the accuracy 
of the estimating procedure depends on 
the normality of the distribution of dis- 
criminability scale values, since the esti- 
mating equations work accurately, as 
both Garner and McGill (1956) and 
Attneave (1959) have pointed out, only 
when both variances are approximately 
of the same distribution. This require- 
ment means that the normal distribution 
is necessary for the mean scale values 
since the error variance is already 
forced not only into 1 unit system but 
also into a normal system. If the stim- 
ulus scale values are off to the extent 
of providing a rectangular distribution, 
the error in estimating could be as 
much as 0.25 bit. The estimating pro- 
cedure can be used quite generally, 
however, if a correction factor is esti- 
mated for the shape of the distribution. 

It is interesting to note that these 
estimates of information transmission 
agree very well with the information 
transmission measures corrected for 
bias, rather than with the uncorrected 
measures, even though no unbiasing 
procedure is inherent in the calcula- 
tions. The reason for this fact is that 
the discriminability scales are based on 
essentially all of the available data, and 
thus are not themselves subject to the 
same sorts of bias which operate with 
the directly computed measures of in- 
formation transmission. In this re- 
spect, the estimating procedure accom- 
plishes much the same effect in min- 
imizing the problem of bias as do 
the procedures suggested by Attneave 
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(1955) for pooling data in various 
ways which are appropriate to specific 
situations. 


THE OptimuM NUMBER OF 
CATEGORIES 


The primary purpose of this paper 
is to demonstrate that information meas- 
ures and discriminability scales are use- 
ful in determining the optimum number 
of rating categories, and furthermore to 
show that these two measures are ra- 
tionally related. It is nevertheless ap- 
propriate to discuss the nature of the 
factors which affect the optimum num- 
ber of rating categories. 

The optimum number of rating cat- 
egories, or at least the number beyond 
which there will be no further improve- 
ment in discrimination, is clearly a 
function of the amount of discriminabil- 
ity inherent in the stimuli being rated. 
Thus there can be no single number of 
rating categories appropriate to all rat- 
ing situations. At the limits of dis- 
criminability, this fact is obvious. For 
example, if ratings are made of a set of 


objects between which there are no- 


perceptible differences, then one rating 
category will do as well as two or more. 
If there is no perceptual discrimination, 
there is no need of a discriminating 
rating scale. 

On the other hand, suppose that we 
have twenty different objects all of 
which are clearly discriminable. Then 
we must have as many rating categories 
as there are objects in order for raters 
to demonstrate their ability to discrim- 
inate, and ranking will be the most 
effective procedure. In information 
terms, we cannot allow the response 
information to limit the measurable in- 
formation transmission. 

The limiting cases are manifest; the 
intermediate cases are much more am- 
biguous. Bendig and Hughes (1953) 
had their subjects rate 12 countries 
for familiarity, and when the Miller- 
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Madow correction for bias is applied 
to their data, there is no improvement in 
information transmission beyond three 
rating categories, with which the infor- 
mation transmission obtained is approx- 
imately 0.25 bit. When Bendig (1954) 
had subjects rate preference for 20 
foods, increased information transmis- 
sion (again corrected for bias) was ob- 
tained up to at least nine categories. 
with which 0.32 bit of transmitted 
information was obtained. In the 
present experiment, increased discrim- 
ination is obtained with up to 20 cat- 
egories, with which approximately 0.50 
bit of transmitted information was 
measured. 

What seems to be clear from these 
few cases is that the number of useful 
categories increases with an increase in 
the amount of information transmis- 
sion, and that the number of categories 
needed is far greater than the infor- 
mation transmission figure seems to 
indicate. 

This fact that the number of useful 
rating categories is far greater than the 
information transmission seems to indi- 
cate is at first glance very surprising, 
particularly since in the absolute judg- 
ment situation, which is so similar, quite 
another conclusion has been drawn. 
When information transmission has 
been measured in the absolute judgment 
context, the obtained measure has been 
used to determine the minimum number 
of categories with which the given in- 
formation transmission can be obtained, 
and many experiments have demon- 
strated the validity of this interpreta- 
tion of information measures. In other 
words, it has been demonstrated that 
there is a channel capacity, such that 
no further information transmission can 
be obtained if the number of categories 
is increased beyond that indicated by 
the channel capacity. 

The absolute judgment paradigm and 
the rating scale paradigm are in fact, 
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however, quite different, and the es- 
sence Of the difference is this: In the 
absolute judgment paradigm, the num- 
ber of stimuli and the number of re- 
sponses are varied concomitantly, while 
by the nature of the problem they can- 
not be in the usual rating scale applica- 
tion. In the absolute judgment case, if 
we have an information transmission of 
2.0 bits, we can interpret this figure as 
indicating that we can obtain just as 
good discrimination if we use four 
stimuli and four responses as if we use 
any larger number. Furthermore, we 
can and should select the particular four 
stimuli in such a way that maximum 
discrimination will be realized. 

The nature of the rating scale prob- 
lem, however, is that we have a’ fixed 
number of objects or samples to rate, 
and we are not free to make this num- 
ber smaller. If 20 different foods, or 
50 men in a plant, are to be rated, we 
are not really interested in seeing how 
well we can transmit information if 
fewer foods or fewer men are rated: 
we want information about each and 
every stimulus to be rated. 

As long as we are not free to select 
a small representative set of stimuli 
(i.e., cannot recode into a more efficient 
set), then we will need to have a large 
number of rating categories to obtain 
relatively small amounts of information 
transmission. If two adjacent stimuli 
on a discriminability scale are reliably 
different, then the rating scale must be 
sufficiently detailed to allow that differ- 
ence to be shown. 

The nature of this problem is easily 
shown with reference to the hypothet- 
ical set of data shown in Table 2. For 
this table, the contingent uncertzinty, 
or transmitted information, is 1.07 bits. 
Now we can easily select two stimuli 
and two responses which will allow us 
to obtain 1.00 bit of contingent uncer- 
tainty—we simply take the first and 
fourth stimuli and assign one response 
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TABLE 2 
AN ILLUSTRATIVE CONTINGENCY TABLE 
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0.04 | 0.21 
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Note.—Cell entries are proportions of joint 
occurrence, 


to each and the pattern of the data 
makes it clear that we can expect to 
obtain no confusions between these 
stimuli. Thus we will obtain 1.00 bit 
of transmitted information. 

Suppose, however, that we must keep 
all the stimuli, but want to reduce to 
two response categories. The best we 
can do is to group Response 1 with 2 
and also 3 with 4. But since the re- 
sponses to the second and third stimuli 
will overlap these two new response 
categories, there must remain some re- 
sponse equivocation, and we cannot ob- 
tain the maximum of 1.00 bit of infor- 
mation transmission. In this actual 
case, if the data from Table 2 are 
pooled to form the 4 X 2 matrix, the 
resultant information transmission is 
0.68 bit. 

Thus the essential difference between 
the absolute judgment situation and the 
rating scale situation is not in the char- 
acter of the stimuli, but rather in the fact 
that we feel free to select stimuli in one 
case and do not in the other. If loud- 
ness ratings were obtained under equiv- 
alent conditions, i.e., if we keep a fixed 
and large total ‘number of stimuli, but 
change the number of rating categories, 
information transmission would vary 
with the number of rating categories 
also. 

The small amount of available data 
suggest that the response information 
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(i.e., logarithm to the Base 2 of the 
number of categories) should be about 
5 to 10 times as large as the expected 
information transmission, at least for 
small amounts of information transmis- 
sion. For small numbers of stimuli to 
be rated, it is almost certainly worth 
using complete ranking rather than 
coarser scales. And as a last statement, 
it is clear that information transmission 
cannot be lost by increasing the number 
of rating categories. Therefore, it is 
better to err on the side of having too 
many categories than to err by having 
too few. 
SUMMARY 


The purpose of this paper is to show 
how information transmission measures 
and discriminability scales can be used 
to determine the optimum number of 
rating categories. An illustrative set of 


data were obtained for 20 samples of 
handwriting with 4, 6, 8, 10, 12, 16, 


and 20 rating categories. Information 
transmission measures, after correction 
for sampling bias, show a small but 
definite increase in discrimination up to 
the 20 categories used. Discriminabil- 
ity scale values were also constructed 
from the rating data, and the standard 
deviations of the scale values show the 
same increase in discrimination. The 
variance of the discriminability scale 
values can be used to estimate informa- 
tion transmission, and the estimates 
agree very well with the directly cal- 
culated information measures. It is 
pointed out that the optimum number 
of rating categories must depend on the 
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discriminability inherent in the partic- 
ular rated stimuli, and that the optimum 
number of categories will be large com- 
pared to the measured information 
transmission. 
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THE MULTIPLE ABSTRACT VARIANCE ANALYSIS 
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VARIABLES 
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ESSENTIAL NATURE OF THE METHOD 


Research in psychological genetics 
operates either on: (a) particulate 
behavior syndromes, e.g., Hunting- 
ton’s chorea, phenylpyruvic amentia, 
which are visibly either present or 
absent, and which can usually be 
directly handled by Mendelian con- 
cepts and analyses applied to familial 
observations; or, (6) apparently con- 
tinuously distributed behavior traits, 
the variance on which one can attempt 
to assign to various environmental and 
genetic sources. Some inferences in 
terms of the former can to a slight 
degree be made from the latter type 
of biometrical and population genetics 
observations through the mathemati- 
cal treatments of Fisher (1930), 
Haldane (1932), Kempthorne (1957), 
Mather (1949), Wright (1931), and 
others. However, psychologists would 
in general be content, and, indeed, 
greatly heartened, even to be given 
some reliable information about the 
nature-nurture variance ratios for 
the more important continuous hu- 
man personality and ability dimen- 


‘The writer wishes to express his great 
indebtedness for discussion and advice to the 
following (who are nevertheless not to be held 
responsible for conclusions) : geneticists N. E. 
Morton and W. J. Schull; geneticists and 
statisticians H. W. Norton and R. W. Touch- 
berry of the University of Illinois; and statis- 
ticians D. L. Burkholder, R. A. Fisher, Q. 
McNemar, and, especially, C. R. Rao of 
Calcutta. He wishes further to thank his 
issistants Merlin Foster, John Horn, and 
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sions. For the overwhelming majority 
of psychological measures are, as far 
as we can see, continuous, and the 
instances that can be directly an- 
alyzed by particulate Mendelian anal- 
ysis are negligibly few. 

Until a few years ago all published 
instances of nature—nurture variance 
analysis ratios for psychological vari- 
ables were based on studies of identical 
and fraternal twins (sometimes sibs). 
At that time the present writer (1953) 
put forward a design called the 


Multiple Abstract Variance Analysis 
method, which seemed capable of 


answering more questions than the 
twin method and of avoiding certain 
artificialities and limitations therein. 
Although developed independently in 
the field of psychological measure- 
ments, it proved on examination 
to be much the same in general 
principle as the estimation of com- 
ponents of variance used in agricul- 
tural and biological genetics. How- 
ever, the MAVA method (as we 
shall henceforth abbreviate it) con- 
tains features unique to the human 
cultural situation and to psycho- 
logical measurement. Moreover, it is 
new to psychologists, being exempli- 
fied’ in only two studies. The first, 
measuring personality factors by ques- 
tionnaire, occurred in 1955 (Cattell, 
Blewett, & Beloff), and the second, 
using objective, behavioral batteries 
for the personality factors, and a 
culture-fair measure ot intelligence, 
was published in 1957 (Cattell, Kristy, 
& Stice), 
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The results of these first applica- 
tions, in that they demonstrated 
diverse and characteristic nature— 
nurture ratios for well known person- 
ality dimensions, agreeing, moreover, 
with ulterior evidence, were encour- 
aging. It seemed very desirable, 
therefore, to develop further the 
potentialities of the method itself.’ 
This paper sets out to make a closer 
examination of the method, its as- 
sumptions, its calculations, and its 
needs in terms of more refined de- 
velopment. 

The title of the method derives 
from the fact that conceptual vari- 
ances are introduced, i.e., variances 
which can only be inferred or “ab- 
stracted”’ from actual, observed vari- 
ances. The label designates this 
abstractness in order to avoid con- 


fusion with the usual use of the 


analysis of variance design in which 
questions are asked purely in regard 


to concrete variances. It is “‘multi- 
ple’ because it views each concrete 
observed variance as the result of 
several, abstract, inferred variances. 
The MAVA method in fact postulates 
that the observed variance on a given 
trait, e.g., a personality factor score, 
measurable on a continuum, can be 
resolved into four contributory vari- 
ances as follows: 


? The writer has to apologize for five years 
of delay in presenting this fuller statement, 
occasioned by nothing more than writing 
difficulties in presenting a simultaneously ade- 
quate explanation of genetic concepts, for 
psychological readers, and of psychological 
measurement assumptions, for geneticists, in 
anything less than book length. The delay is 
regrettable because psychological genetic 
researches have been started in the interim on 
what is essentially an obsolete twin design, 
or researchers have been compelled laboriously 
to derive independently from the initial state- 
ment (1953) the formula now given in Tables 
1 and 2. This situation may perhaps justify 
the present necessarily uncomfortably con- 
densed presentation. 
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Variance through causes resident in the 
differences of environment between fami- 
lies. Henceforth written o*,,. 

Variance through causes resident in the 
differences of heredity between families. 
Henceforth written os. 

Variance through causes resident in the 
differences of environment for individuals 
within the family. Henceforth, a7. 

Variance through causes resident in the 
differences of heredity between siblings 
within the same family (gene segre- 
gation). Henceforth, 7s. 


Furthermore, it does not assume 
that these sources are uncorrelated 
but attempts simultaneously to de- 
termine what correlations exist be- 
tween these hereditary and environ- 
mental variances, both within families 
(henceforth, rwe.wa), between families 
(henceforth, rs..»,), and, occasionally, 
through the rare correlation of a 
within and between variance. Ob- 
viously, the equations below imply 
that heredity and environment act 
mutually in a simple additive fashion, 
without any ‘“‘catch all” category to 
accommodate to all kinds of inter- 
action effects, as usually conceived in 
analysis of variance. At the present 
stage of the model, one could either 
assume heredity and environment to 
be uncorrelated and then permit 
interaction, as in analysis of variance, 
or sacrifice interaction. The assump- 
tion of uncorrelated heredity and 
environment seemed to us, in the 
psychological field, less tenable.* 

A more ideal model would be one 
which both allows covariance, as here, 
and also interaction, but, while such 
a model is being worked out, the 
present model can be expected to give 
the usual approximate fit of a simpler 
model to a, possibly existent, more 


* Were it not that one sometimes hears the 
remarkable statement that “psychological 
traits cannot be wholly resolved into genetic 
and environmental causes” it might be un- 
necessary to add here that we assume that the 
variances and covariances are exhaustive of 
the measured trait variances. 
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complex natural pattern. Addition- 
ally, the assumption will be recog- 
nized that the values obtained for the 
variances and correlations are specifi- 
cally those existing in the given cul- 
tural and racial sample (referable, if 
desired, to parameters for a statistical 
population of that type), and that 
generalization beyond this, e.g., to 
modern, primitive, etc. cultures, is 
possible only in the light of additional, 
strategic, experiments. Since a com- 
plete statement of assumptions in a 
model can be endless, the reader 
should accept those obviously im- 
plied, while others will be mentioned 
later as they become relevant. 

The general manner of building 
equations relating observed to ab- 
stract variances will be evident from 
inspecting, say, Equation 4 in Table 1 
below. Here it is stated that the 


observed variance within a series of 
pairs of siblings reared together, o’sr, 


is the sum of within family environ- 
mental variance, plus within 
family heredity variance (due to differ- 
ent segregations of parental genes), 
or, Plus the covariance (if these 
influences are correlated), 27raweGwn. 
The calculation of the observed vari- 
ances, as such, will be discussed later. 
Originally only six such equations 
were written (Cattell, 1953), because 
it was presumed that there would only 
be six unknowns—four variances and 
two correlation coefficients. Thus by 
the solution of six simultaneous equa- 
tions, using the six experimentally de- 
terminable concrete variances—o’s7, 
asa, etc.—it was hoped to solve for 
the six abstract, conceptual values. 
However, six equations and six 
observed variances are not enough, 
because: (a) The last two or three of 
the six equations import other un- 
knowns. For example, one of our 
necessary equations was for twins 
reared together and, unless one makes 
the psychologically unsatisfying as- 


a 
Owes 





sumption that the typical environ- 
mental difference in the lives of 
twins is the same as that between 
sibs, a new term, for a special kind 
of intersib environmental difference, 
Oe’, Must be introduced. (6) Simi- 
larly it turns out that with additional 
experimental groups more than the 
above two heredity—environment co- 
variances are involved. (c) Linear 
dependencies systematically appear 
between the equations for “‘within,”’ 
“‘between,”’ and ‘“‘total’’ variances in 
each of the family types one uses, 
thus reducing the number of effective 
equations. 

The additional equations that may 
be used, and the alternative assump- 
tions possible in using various com- 
binations will be discussed after the 
following section, clarifying the calcu- 
lation of concrete variances. 


CALCULATIONS AND ASSUMPTIONS 
CONCERNING CONCRETE, Ex- 
PERIMENTALLY MEASURED 
VARIANCES 


Conceptually, the within family 
variance, on some such trait as in-°: 
telligence, surgency, etc., is the vari- 
ance (a) among the children, i.e., not 
including the parents; (6) of the 
typical American family (or whatever 
cultural or ethnic group one studies) ; 
(c) affected by size of family only 
statistically, i.e., to the extent that 
the variance. estimate uses different 
degrees of freedom with different 
family sizes (actually we took mostly 
pairs but occasionally trios, from 
which .to estimate this variance); (d) 
corrected for age differences, i.e., it is 
the variance that would exist, on 
traits susceptible to age change, if all 
sibs were brought to the same age; 
(e) corrected for experimental error of 
measurement, but mot for function 
fluctuation. 

The justification for using the con- 
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cept thus is partly experimental 
practicability and partly the achieve- 
ment of maximum negotiability in 
further theoretical or calculation use 
of the obtained variances. Objection 
can be raised to (c) that large families 
might have, for example, smaller 
variance, on certain traits, due to 
more powerful group “coercion to the 
bio-social mean’ (Cattell, 1957). 
This can be investigated by our 
design, through an experiment with a 
sample of small ‘size families con- 
trasted with one of large size. Mean- 
while the adopted assumption is 
simpler. As to (d), it is true that by 


‘ 


Oo"; 
[rue 
(Individual Difference 
Variance 


Although the allowance for error of 
measurement, derived from the de- 
pendability coefficient of 


the test 
(Cattell, 1957) by o%, = 1 — fra, is 
always only an estimate, it is better 
than not allowing for it at all. The 
question whether function fluctuation 
should be taken out depends on one’s 
conception of environment. Periodic 
or aperiodic, short-interval, function 
fluctuation occurs in most personality 
and ability measures (Cattell, 1957; 
Fiske & Rice, 1955; Thouless, 1936) 
from internal or external causes. 
Should one say that the ergs of 
hunger or sleep or the anxiety reac- 
tion are not innate because they 
fluctuate hourly? The writer is in- 
clined to the view that over a sufficient 
series of standard situations a reason- 
ably stable strength level could be 
assigned to an individual on such 
fluctuant traits and that it is this 
level which we speak about when we 
talk of individual differences, e.g., in 
anxiety. For most purposes it is the 
nature—nurture contribution to this 
relatively stable “over a couple of 
months” value, at various age levels, 


Observed 
Variance 
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allowing an age correction we are al- 
ready here using an “abstracted” 
rather than a literal, concrete vari- 
ance; but there is no point in con- 
founding psychological variance with 
age variance. If one knows the age 
variance of families in a given culture, 
the concrete variances can always be 
readily “restored” from the obtained 
variances through this more analytical 
approach and, meanwhile, the nature— 
nurture ratio is more pure, comparable, 
and comprehensible for an age-con- 
stant group. 

The issue in (e) is best seen by 
setting out the equation: 


9 ° 
oe os 


Function 
Fluctuation [1] 


Error of 
Measurement 


that. we are interested in. Function 
fluctuation variance is calculable from 
the difference of the dependability 
and stability coefficients (Cattell, 
1957), but, for lack of the latter it was 
not actually taken out of the variance 
in our experiments (Cattell, 1957; 
Cattell, Blewett, & Beloff, 1955). 
When not taken out it theoretically 
swells the environmental variance, and 
it is significant that the distribution 
of ratios for two dozen personality 
dimensions in our studies was de- 
cidedly skewed to the environmental 
predominance end. If experimental 
error is not taken out, it also pre- 
dominantly swells the variance as- 
cribed to environment. Ideally, there- 
fore, both should be taken out before 
calculating the nature—nurture ratios. 

The assumptions and problems in 
the estimation of the observed be- 
tween-family variance are essentially 
of the same nature. Again, one is 
estimating a value from the sample for 
a statistical population assumed to 
be homogeneous with the sample, 
and, in this case, the correctness of (b) 
above—that the statistical population 
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is the typical American population 
is harder to justify in the circum- 
stances commonly prevailing in na- 
ture-nurture studies. For, without 
quite extraordinary exertions by 
the experimenters, a disproportionate 
number of cases will come from social 
welfare agencies. Obviously, the ideal 
requirement is a k-way classification 
of families by religion, occupation, 
age of parents, economic, social, 
education, etc., background of parents, 
etc., corresponding to a stratified 
population sample (or, when investi- 
gating the effect of particular condi- 
tions, e.g., religion, on the ratios, a 
deliberately selected sample). 
Another specification which must 
be watched in defining and using the 
various variance ratios from the 
MAVA is whether the differences con- 
sidered are between boy, girl,‘ or 
boy-and-girl families. Even in the 
third case the equation for identical 


twins must, naturally, remain that 
for same sex pairs; but, provided 


oe has been used for the special 
between-twin environmental variance 
these same sex pairs can be used in 
the total solution. 

As to the degrees of freedom, the 
within-family values, when pairs of 
children are taken (deviation of each 
from their common mean, squared and 
added over k families), will be &. 
When 2 children instead of two are 
taken from each family it will be 
k(n — 1). For between-family vari- 
ance, obtained by subtracting the 
grand mean of all (sib) families from 
the: mean of each family, squaring 
and adding, it will be k — 1. 


CONCEPTS AND ASSUMPTIONS 
IN THE EQUATIONS 
It will be evident from the initial 
illustration by the case of within- 


* Not only environmental, but also heredi- 
tary within family variances could be different 
for girl-girl pairs, due to the extra chromosome. 


family environmental variance above 
that the experimenter in this area is 
placed in a theoretical dilemma pe- 
culiarly embarrassing to a perfection- 
ist. As he adds more equations in the 
hope of more completely determining 
unknowns, he adds new unknowns. 
In the writer’s opinion one can achieve 
a complete solution in the sense that 
the number of experimental situations 
capable of yielding new equations 
eventually overhauls the number of 
unknowns. But since it is not be- 
yond the wit of man to invent even 
finer conceptual subdivisions of family 
types and situations to which he 
would like a separate answer, let us 
admit that the process is asymptotic. 
Consequently, it is up to the experi- 
menter to draw a line where he will in 
data gathering, in the light of present 
estimations of magnitudes of various 
influences and the steps dictated by 
research strategy. 

Accordingly, the presentation is cut 
to two designs, one, called the Limited 
Resources Design, which makes grosser 
assumptions in order to accommodate 
to the resources of nine out of ten 
investigators, and the other, called 
the Complete Design, which aims at 
what, by ordinary standards, can be 
called a complete solution, regardless 
of difficulty or expense. The assump- 
tions in these two designs can best be 
discussed when we have considered 
the symbols used, as follows: 


Abstract Variances 


1. oe = trait variance (between sibs) due 
to within-family environmental 
(we) variation. Incidentally, we 
assume that this variability of 
environment among children raised 
in the same family is essentially 
the same for all kinds of families, 
including normal and adoptive 
families. On the other hand in 
the special case of fraternal twins, 
who are of identical age, and 
identical twins, who, according to 
clinicians, have an urge to be more 
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alike or (in other opinions), more 
individualistic, we enter with spe- 
cial terms capable of discovering 
such differences, as follows: 

trait variance within family asso- 
ciated with type of environmental 
difference peculiar to fraternal 
twins. 

trait variance within family asso- 
ciated with type of environmental 
difference peculiar to identical 
twins. 

trait variance due to  within- 
family heredity (wh) variation. 
trait variance due to between- 
family environment (be) variation. 
trait variance due to between- 
family hereditary (bh) variation. 


Another possibility—-that the with- 
in-family environmental variance 
among sibs might be different in 
natural and adoptive families—has 
not been given separate representa- 
tion at this point. 


Correlations 


7. Twhwe = correlation between effects of 
within family heredity and within 
family environment (siblings 


A systematically different value for 
this correlation might arise in special 
circumstances, notably (a) when the 
two sibs are of the same age, i.e., 


fraternal twins. For example, a trait 
which might be fostered in a younger 
by an older brother might be in- 
hibited in the interests of competition 
when they are the same age; (b) when 
it is an adoptive family, i.e., not true 
sibs and raised by foster parents. 
For, whether we believe with G. B. 
Shaw that parents are allergic to their 
own hereditary defects appearing in 
their children, or take a more kindly 
view that they will be more under- 
standing, the fruews will at least be 
different. Moreover, the hereditary 
variance in adoptive (e.g., foster) 
families is typically greater than for 
ordinary families, being a sum (or 
other combination) of within and 
between hereditary variances. If 


there are reasons for r settling down 
to a certain value with a certain 
variance, it is unlikely to remain at 
the same value when the range of one 
variable is altered. Actually we have 
assumed here that increased variance 
of heredity (Equation 6, Table 1) and 
of environment (Equation 5, Table 1) 
affect the rwr.we term in the same essen- 
tial fashion and we have represented 
both by a single new r as follows: 

8. r'wh.we = correlation of heredity and en- 

vironment in adoptive families. 


In such families also, it will be 
noted, we have the peculiar condition 
of a possible correlation between a 
between-group and a_ within-group 
variance, in that the foster parents 
may, for example, treat the children 
differently according to the hereditary 
variations appearing from the fami- 
lies from which they derive. Hence: 
9. r'vedbh = correlation of within-environ- 

ment and between-heredity pe- 
culiar to adoptive families. A 
prime is placed on this r, 
though there is no other fue. 
from which to distinguish it, 
because it helps classification to 
have all r’s peculiar to adoptive 
families marked with a prime. 
correlation between effects of 
between-family heredity and be- 
tween-family environment. The 
frequently discussed correlation 
of “‘good”’ heredity with “‘good”’ 
environment among families is a 
possible instance. 


There follow two possible correla- 
tions, probably nonzero only because 
of what may be called ‘placement 
agency” effects. In our society this 
arises partly from the deliberate 
policy of welfare agencies of placing 
more able children in families likely 
to give them a better educational 
environment, but in most societies 
it might arise from a tendency to 
adopt orphaned relatives. These place- 
ment agency r’s are marked by a 
double prime, as follows: 
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i... correlation 
peculiar to sibs raised in differ- 
ent families, where  within- 
family hereditary differences 
may become correlated with 
between-family environment by 
deliberate intention of social 
agencies in placing, e.g., brighter 
sib in more educated family. 
placement agency correlation, 
through any systematic tend- 
ency to match biological family 
characters of adopted children 
to environmental characters of 
foster parent families 


placement agency 


Between four main variables 
wh, be, bh—there are six possible 
correlations (more, with special vari- 
which four would 
normally be dropped on the assump- 
that no correlation commonly 
exists of within and between vari- 
However, having found two 
exceptions we should look for more. 
The possibility that ruron and Swede 


we, 


ables above) of 


tion 

ances. 
could be nonzero is however, hard to 
For example, it is difficult 


to see any way in which an individual's 
deviation in heredity from his family 


conceive. 


mean could be connected with the fact 
that his family differs in mean from 
the community hereditary mean. On 
the other hand, if, say, specific “‘roles’’ 
are more emphasized in highly than 
poorly educated families, the possi- 
bility that an_ individual’s 
deviation environmentally from his 
family mean will be greater when his 
family is environmentally most posi- 
tively deviant from the community 
mean. This would give a significant 
curvilinear correlation but no signifi- 
cant linear correlation. This will be 
recognized as another restriction on 
our model, that it does not respond 
to curvilinear correlation among vari- 
ances. But it does mean. that no 
especial reason exists for complicating 
our equations with an fiewe term. 
We are left, therefore, with two 
“main” (r) correlations, two ‘‘com- 
ponent” (r’) correlations, and two 


exists 
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(r’’) correlations requir- 
ing terms for solutions. 


“placement” 


If space permitted, a number of 
other interesting theoretical possi- 
bilities could be discussed here, though 
most would be rejected as highly 
improbable. What may seem an 
undue refinement of conception, but is 
nevertheless vital in later interpreta- 
tions and applications, is implicit in 
the above labeling of within and 
between-family hereditary and en- 
vironmental variances as “effects’’ 
not “influences.” It would not be 
correct, for example, rigidly to reason 
that all environmental variance within 
families comes from influences within 
the family or all environmental vari- 
ance between families from influences 
within the community operating differ- 
ently on these families. Similarly the 
correlation of a within-family heredi- 
tary deviation with a within-family 
environmental effect does not imply 
that the correlation is brought about 
entirely by influences within the 
family. Tracing influences is a second 
step after locating effects. 

To determine the 12 unknowns 
above (6 variances, 6 r’s), which we 
have found it necessary to introduce 
on theoretical grounds in the Com- 
plete Design, one must employ, for 
reasons stated, more than 12 equa- 
tions. The following 16 experimen- 
tally determinable variances, each for 
a particular kind of group, are re- 
quired to represent the left side of 
(The 
equations themselves are set out in 
Tables 1 and 2). 


each of the equations needed. 


variance in the trait as meas- 
ured among identical twins 
reared together by their 
parents. 


. CTT 


variance in the trait as meas- 
ured among identical twins 
reared apart. 

variance in the trait as meas- 
ured among fraternal twins 
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raised together by their true 
parents. 

= variance in the trait as meas- 
ured among ordinary siblings 
reared together by their 
parents. 
variance in the trait as meas- 
ured among ordinary siblings 
reared apart. 

= variance in the trait as meas- 
ured among unrelated chil- 
dren reared together in one 
family with foster parents 
who are biological parents of 
none. 
variance in the trait as meas- 
ured among unrelated reared 
apart (general population). 

= variance in the trait as meas- 
ured between the means of 
identical twin pairs in their 
true families. 
variance in the trait as meas- 
ured between the 
identical twin 
apart. 

= variance in the trait as meas- 
ured between the means of 
natural families, i.e., means 
of sib pairs (or trios, etc. 
reared together. 
variance in the trait as meas- 
ured between the means of 
biological families whose sib 
lings are reared in different 
foster homes. 
variance in the trait as meas- 
ured between the means of 
social families, i.e., adoptive 
families of biologically un- 
related children. 
variance in the trait as 
measured among half-siblings 
raised together. 
variance of half-siblings raised 
apart. 
variance in the trait as-meas- 
ured among means between 
half-siblings raised together 
by one true parent. 

= variance of means between 
half-siblings raised apart. 


means of 


pairs raised 


2 
+ O BBP 


2 
o°HST 


2 
+ OHSA 


. ognstTr® 


. ao BHSA Fr 


Other familial groups yielding ex- 
perimentally determinable variances 
are theoretically conceivable, but they 


5 A further specialization can be made here 
according to whether child is raised with 
mother or father. 


demand too much tact on the part of 
the experimenter in locating them, or 
too much moral aberration on the 
part of the community, to be prac- 
ticable! 


EQUATIONS FOR THE COMPLETE 
SOLUTION AND THE LIMITED 
RESOURCES DESIGNS 


The 16 equations which link’ the 
above experimental, concrete vari- 
ances with the abstract variances and 
correlations seemed to produce them 
are shown in Table 1 below. We 
assume they will require no discus- 
sion, beyond the illustration already 
given of a typical equation (Equa- 
tion 4) for most readers. Perhaps a 
few words on a more esoteric example 

Equation 13, half-sibs reared to- 
gether—will complete the explanation. 

Here we might have two children 
of one mother by two fathers. I nci- 
dentally, introduction of a different 
term for children of two mothers by 
one father is not considered necessary 
at this stage, nor is the variance con- 
sidered modified by the different heads 
of family in which the two half-sibs 
might be brought up together. They 
will differ by (a) within-family hered- 
ity variance, o*,,. For each is only 
one ‘“‘choice’” among the possible 
parental gene segregations; (b) also, 
in this case, by half the interfamily 
hereditary variance, }07,,, for one 
parent is the same and one different ; 
and (c) by the usual within-family 
environmental differences, 7... In 
terms of covariances there will be (d) 
a “component variance” correlation 
of the between-family hereditary vari- 
ance, (which is now part of the total 
within-family environmental — vari- 
ance) with environmental variance in 
the family. This produces the co- 
variance term, V2r'wesrtwetrn. (The 
coefficient V2 before this is derived 
from 2/v2, the usual coefficient 
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rABLE 1 
[HE SEVENTEEN GENERALLY PRACTICABLE EQUATIONS FOR THE 
COMPLETE MULTIPLE VARIANCE DESIGN 


Note The arrangement of equations in Table 1 can, for purposes of simplification be regarded as creating 
two blocks: 1. 1 through 12, which are constellations of ordinary sib and twin families, and which divide into 
within variances (1 through 6) and between variances (8 through 12), and, 2. constellations in half-sib families, again 
the first half being within and the second between variances. The matching of each between with its correspondin 
within is indicated by (Suppl. ““X"’)—Supplementary to the earlier numbered equation—written beside the matche 
equation 

(1) 
(2) 
27 wh-weF whT we or replace last term by 2rwh.we'FwhO we’) [3] 
27 wh-weF whT we [4] 


h-weF wh wel + 29 wh. be wh be | 


; >? 
wh-weFwhF we T <T we-bhOweT bh 


Ve 
+ 2 wh-weF whT we 27 bh-beT rT be 


Suppl 


o* BITTF 3. hae t ‘, 2he + 2V2rwh.weTwhT we + AT br-beTbAT be 


(Suppl 
opiTAr = 2a% rh + o%we + 2074 + Orbe + 2V21' wh-weFwrhTwe + 2V21' br. beT oA be 


+ 2V29' we-brO we br + 2V29" wh. be oF be] 


r a - : * , : ) 2 ) 7 
lo* grrr o*« Cre t+ 2a75n + 2orbe + 2fwh.we'’FwhT we’? + AT bA-beTbAO be | 
(Suppl. 4 or 3 and 4 
o'pnrP = o*y . 207 5h + 2o%be + 2fwh-weFT wT we + A bh-beTbAT be 
(Supp! 

° > 9 Pa 
o* 2 BF Grek + Crue + 2o7n 4+ + h-weTwhFwe + 2V2F' we-bhOweF bh 

+ 2V2 9 bh. beT bn beL + 
(Suppl 


o’ BSP teh “we *5 2 wh-weT whT we 
2V2 9" wh. beF whO be | 
o787 wh + 7, we wh we + V2T' we-bhTweT dh 
2 2 2 + 7r 4. Wr 
C°HSA w “lr wh-weT wh? we “lbh. beT DAT be 
(Suppl. 13 
, ' e 
o* BHSTF “s t “we 2a7 te + 2rwh-weF wh we + 2v3 ron beT bAT be [15] 
Suppl. 14 


o* BHSAF ‘wh + Owe + wh weF whF we + VOT bd. beT br be [16] 


2 being multiplied by the root of are similarly derived and from con- 
a half because o*,, is present only _ sideration of each it will be evident 
as a half variance.) (e) The usual that a good deal of careful reasoning 
covariance of heredity andZenviron- from first principles, and discussion 


ment, 2fwewhTweTwr- Other equations among experts, has gone into them. 
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TABLE 2 


TEN Equations USED IN THE LIMITED REsouURCES DESIGN 


+ owe! + 25 wh-weF wh we’ 


T O'we T re h- weT whO we 


Owe o be + 27 wh- we? whF we 


2 


2 


o*UA = oy, + Owe 


2 , P 
o'we T Obs T Zr wh-weT whT we + 2r we- bhT weT bh 


an 
+ a5, + ore + 2wh-weFwhFwe + 2 bh-beTbAT be 


opitre = 2a%wh + orwe + 2o7%n + 2o%%—e + 2V2Fwh.weFwhTwe + A bA-beTbAT be 


o* BNP 2 


o* BBP 


o'* Bsr 


To get the systematic organization 
of terms needed in such a presentation 
we have followed the rule of putting 
heredity terms before environmental 
ones, variance before covariance, with- 
in-family before between-family. The 
primes and double primes on r’s are 
explained above. Occasionally, where 
a mathematically possible combina- 
tory term might also seem needed by 
the reader, but has been dropped by 
us on fuller consideration, it has been 
placed in brackets, i.e., strictly we 
regard the term as unnecessary. We 
have deliberately refrained from work- 
ing out as decimals the various 
numerical coefficients, since their rep- 
resentation as radicals enables the 
reader more easily to perceive their 
derivation. 

The equations of Table 1 are pre- 
sented as a totality of experimentally 
available equations, from which an 
experimenter may draw according to 
the research resources available to 
him and the degree of simplification 
of assumptions which he cares to 
make. Even the Complete Design 
cannot simultaneously use all of them, 
the linear dependencies 
though this does not 


because of 


noted above, 


- 2 + 2 ade 2 4. Ja2 A. 3 L. De’ -2 ty’! 
o"wh Owe T Fbh T £0*be “l wh-weF whF we TT LT we-bhOueTbh TF VLI bh- beT AT be 


2 —_— oe 2 ? 4 
O*wh 1 Owe T LO°bh TP LO%be TP Ll whe weFwhFwe T 41 dh-beTbhT be 


3 : 2 2 , ” r 
= oun + o%we + 2o7en + orb + 2eh-weF wh we + 2V27' webhO weer + 2V2r' on.reorntte [9] 


[10] 


avoid collecting the data that would 
be required to use virtually all of 
them.* The 11 equations which we 
advocate as most effective for the 
final determinations of all 11 un- 
knowns in 16 are: 1, 2, 3, 5, 6, 7, 8, 9, 
10, 11, and 14. The only regrettable 
waste in this choice is that it does not 
use sibs raised together, Equation 4, 
an easily obtainable group, but the 
alternatives are worse. For example, 
half-sibs raised together are needed in 
most others, and these are socially 
difficult to identify, with scientific 
reliability. 

The costly features of the Complete 
Design are identical twins reared 
apart and half-sibs reared apart, both 
of which require much effort to locate. 
Accordingly we have suggested for 
most the Limited Re- 
sources Design in Table 2, which uses 
only 10 equations and requires only 
sibs, and, in twins, those reared to- 
gether. A sufficient sample of these 
should be locatable by any investi- 
gator near a large city. It is not 


researchers 


*Commonly, one has the choice of using 
either the within or the between variance 
from data gathered on a particular type of 
familial group. 
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possible to create this desigii sumply 
by lifting 10 equations from Table 1; 
for the assumptions that are made in 
each have to be changed in order to be 
able to reduce the total number of 
equations. The simpler assumptions, 
(a) consider the environmental vari- 
ance for identical and fraternal twins 
raised together to be identical, as co”, ; 
(b) omit differentiation of r’ or we and 
, the modification of the en- 
vironment—heredity correlation when 
the hereditary and the environmental 
variance are increased in families, re- 


Twh.we, 1.€. 


spectively, of unrelated children raised 
together and sibs raised apart. Thus 
it solves for five variances and three 
correlations, instead of for six vari- 
ances and five correlations as in the 
Complete Design. 


THE SOLUTIONS AND THEIR 
CONFIDENCE LIMITS 


Since the solutions of the above 


equations are a considerable under- 


taking—mainly because of the need 


to choose the best sequences in com- 


bining them—it would be helpful 
to have tables set out for the general 
solutions. The results of some three 
or four months of algebraic work and 
checking are set out in Tables 3 and 4, 
in which the researcher can simply 

experimentally-obtained, 
variances to obtain the 
variances. In order not to 
interrupt the main presentation these 
have been placed at the end of the 
article. Quite apart from the sub- 
stantial algebraic task there are cer- 
tain theoretical problems in solving 
the equations of Table 1, which have 
been discussed in another context by 
Nelder (1953). Some of these are 
approached below. 

Additionally to the solutions in 
Tables 3 and 4, which are only as re- 
liable as human pursuit of long ma- 
nipulations of simultaneous equations 


insert his 
concrete 
abstract 
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can be, and should accordingly be 
checked before use, solutions have 
been worked out along different lines 
on the electronic computer. A ma- 
trix, A, consisting of the coefficients 
for variances and covariances arranged 
with rows corresponding to the 11 
equations from the Complete Design, 
is post-multiplied by a Vector X of 
abstract variances and covariances to 
yield the vector of concrete variances, 
B, thus 
AX =B 


Solving for X would give the ab- 
stract variances, thus, 


X = AB 


But, since we wish to compare the 
coefficients for the concrete variances 
in these solutions with those in Tables 
3 and 4, which have to be utilized 
in the following discussion on stand- 
ard errors and confidence limits, we 
stopped the solution short at obtain- 
ing A~'. This was done by the fol- 
lowing transformation because the 
Limited Resources Design, unlike the 
Complete Design, does not present a 
square matrix of coefficients 


A = (A’A)"A’ 


These matrices of coefficients may be 
obtained by application to the author. 
The principal incompleteness of the 
MAVA method as first applied (Cat- 
tell, Blewett, & Beloff, 1955) to 
psychological data was the lack of 
standard errors and confidence limits 
for the values obtained. As Tables 3 
and 4 remind one, each of the abstract 
variances finally obtained is a func- 
tion of quite a number of experi- 
mentally-determined variances, while 
the correlation coefficients have at 
least an equally complex derivation. 
At the present juncture in research 
it becomes important to attempt to 
solve the complex problem here pre- 
sented. An additional complexity 
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rABLE 3 


SOLUTIONS FOR Six ABSTRACT VARIANCES AND Five HEREDITY-ENVIRONMEN1 
CORRELATIONS, IN THE COMPLETE DESIGN 


Note.—For convenience, certain equations below, namely, those having a quadratic solution, are set out in 
triplicate. The first statement shows the roots which give possible positive and negative entries, while the follow- 
ing “a” and “‘b’’ equations give the actual resulting alternative answers—and in decimals since there is no longer 
need to reveal the mode of derivation. In these cases both solutions are retained until further considerations cause 
one to be ruled out 


> 


Sv2e* sirrr—o*sa+.SVlo* pir ar —.SV20* par t+o* 7a + (.5V2 —1)o* ane 
70700 girrr —o*s4+.70700* gir ar —.70700* par +0774 —.29300% ane 
7070¢e? girre —o'sa—.10700* grap +.70700? pap t+o*;7ra -- 1.70700 ewe 


> 


2 2 (gq? 
oP wh-weF wh — \O 


uh —O PTT) —Pwh-we VAP wh-weP wh — 407% wh +40 wre PTT 

o1TT 

5+.5v2 o BITTE — 5vJe"s4 T So" piraAF . So* par » fi 5+.5v2 o'pwrt+.5V2e";7r4 

1.2070e* pirr: 707002 54+.50°% g1rar —.5e* apr — 1.20700? pwr +.707007 74 

— .2070¢? pirre +.70700% 54 +.50*% gir ar — So? apr +.20700* awe —.707007 74 
5vle* girrr +o “ 5v2e* pirar+.5V20* ger + (1 —.5v2)0* ane 
=.70700* girrr +o*sa— 207 .107007 grr ar +.707007 per +.29300* ar 

— 70700 girrr+oe'sa— r 7070¢? gi rar —.70700? gar +1.70700* ener 
to? girrre t+ V2e*sa4t " opitar +o per —o* ane — (2V2+2)e7%784 
=o pirre +1.414le*s4+3. ty a—o* gitar +o’ par —o' Byer —4.82820° 484 
=o? girrr — 1.414107%s4+.585907, o* gitar +o sar —o' pwr t+.8282e7n784 
(.25v2+.5)e? 74 —.250* airre t+ (.25V2+.5)e* 84 


*s4+.Se*airar —.5o* apr — (.5+.5V2)o° ane +.SV207 174 


—.25¢* g:rar+.250* sar 
XK V2e?n4—.5Vle*girre —o?sat+.5 Vie gitar —.5Vle* par t+o*rrat (.5V2 —1)o* ane 


(¥2+1)e*v7sat+ 1 — .25v2)e*anwr— 5v2Je*s4— (.2 + 5v2 a VA 
V2e*v4—.5V20* girre—o*sat+.SV20* Bir ar —.SV20* Bar t+o71r at (.5V2 —1)o* ayy 


+ (.5—.25v2)e*pirart+ (.25vV2 —.5)e*pirre + (.25V2 —.S)o* sar +.5e*%ur 


[9] 


X Ve®sirre+2e%sat+ (2+ V2)0°v,—o* errart+o*aer—o'anr — (2V2+2)e*n84 


t 


* For the sake of simplification it is best to determine values for rws.we and o4vs and substitute these in this 
equation to determine o*%ve’. 

> Solutions 12 through 16 are not directly for any required end value—there are only 11 real unknowns. They 
are introduced as conveniences in the process of solution. They consist of phrases containing radicals, and were 
treated as single unknowns in the solution of the original equations. The correlations rwa.we etc., can be directly 
obtained for these solutions by dividing the solution obtained by the product of the corresponding squares of the 
variances indicated on the left-hand side of the equation. Or, if it is desired the four solutions (7-11) may be used 
and by direct substitution, the values for rwa.we, etc., can be obtained. In Equations 7 through 11, only one solution 
is given 'to save time and space If the plus value of 2 is used in obtaining the variances (Equations 1-6) con 
sistency must be followed by using plus values throughout. 
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TABLE 3—Continued 


_ 5 +.25v2)e*a;rar— (.5+.25v2)e*airre— (.5+.5V2)02 v4 — (.5+.25V2)0* ear 


lbh-be = = = - = : ——— 
V.5vV2e°pirre+o%sa —20*va—.5V20°Birart+.5 Vo ner + (1—.5V2)o*anr 


— (.5+.5v2)e7s4+(V2+1)e*7s44+(.54+.25V2)c* ane af [10] 
x Vo8 grrr +Vletsa+ (2+V2)0%va—o'srrar+o*Bar—o' ene — (2V2+2)0%nsA ‘ 
onsat(.75v2—.75)e2 awe + (.25V2 —1)e?2s44+ (1 —.5v2)0?1 at (.75v2—.25)o* sir ar 
V.5V2o'sitre +o%sa— 20% 4—.5V20* nr ar +.5V20*xer + (1 —.5V2)0* Br 
+ (.25—.75v2)e?arre t+ (.25—.5v2)o* par —.5e*ur 
X Vo'girre+Vl0%sa+(24+V2)0%va—o* ai rar to par—o* pyr — (2V24+2)0*n84 
— = 250? pwr — (.25V2+.5)e?;74 —.250°%arrret+ (.25V2+.5)e*s4 
— (.25v2+.25)e*prrart+(.25v2+.25)c* ser 
25e* BNF 853567 7A 2c girre t+. 85350784 8535e° grap +.853507 ger [12a] 
2507 ays 146502 ;r4—.250% grrr} 14050754 —.14650% e;rar+-14650% gar [12b] 
25,\2-+ 
~ (.25V2+.25)e°airre—.25e°wirar+.25e*ser [13] 
Ycheaties 85350? 94 —.853507% aarp +.6035¢7 BNF 6035e* Birrr 25e* girart+.25 [13a] 
r’ ch-weF whl we =.146507 54 146507 gsr —.1035e? awe +.1035e°%grrrr —.250*% Brrart+.25 [13b] 


” we bhT weF bh V2 +le’asat (1—.25v2) oc awe —.5V20* 54 — (2 


“4 2 c c c 9 c 7 
+ (.5—.25v2)o*a:rart (.25V2 —.5)o* airrr + : -)) r+.507 7 [i+] 


2.4141e?ys4+.65650* ge 707007 5 4 — 2.70790? 7 4 +.1465e07 gi rar 
14650 grrr —.1465e°sar+.5e*vr [14a] 


Py sa t1.3535e07% gy ¢ +.70700% 54 — 1.29300? v4 +.85350*% Bitar 
- 853507 airre 853507? gar + 5e?, T [14b ] 
25v2)o* BIT AF 5 +.25v2)e* eirre — (.54+.5v2)e*va 
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8535e? errr — 1.20700? v4 —. 85350? par — 1.207007 54 
+2.4141e?y754+.85350* Bwr 


Tbh beTbhO be 14650? a rar — 14650 grrre +.207007 vy 4 — 14650? per +.207007 54 


4$141e*ys4+.46500? snr [15b] 


1” bhebeTbAT be as ; —.7! (.25v2 —1)o*s4+(1—.5V2)e? 


UA 
25 —.75v2)e2pirret+(.25—.5V2)eaer—.Se*ur [16] 


1 bhebeFbhOhe = 02H SA +.310607% BN er —.646502 54 +.29300? v4 +.81060* BT AP 


~.81060% irre —.45700* gar —.5e°ur [16a] 


” the beTbiThe =OOHSA 1.8106¢’ ewe 1.3535¢7%s4+1.7070e? v4 — 1.3106¢* gitar 
+1.31060°g;rrr¢+.95700 sar —.Serur [16b] 


lies in the fact that most investigators ous ratios between these, or between 
(including the present writer), have certain sums of these, e.g., the ratio 
usually not been content with the of total environmental to total heredi- 
values for the four main variances tary variance. For these final ratios 
but have proceeded to calculate vari- are more likely to express the ultimate 
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TABLE 4 
SOLUTIONS FOR THE FIVE VARIANCES AND THREE CORRELATIONS IN THE 
‘ LIMITED RESOURCES DESIGN 


(2—vV2)e?srt+o%sa4 ot urt (2V2 —2)e°v4—o pire +(2—V2)e*anwr+o*aar—o'asr 
5$859e%s7r+e'%s4—o? yr +.8282ev4—o' airre +.5859e ane +o per —o' Bsr 
Owe =3.414lesrt+es4—otyur—4.818leu,g—oerrret3Ald4leaner+oaer—o' esr 
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+-.7070¢07 pear —.70700 gsr 
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(Square root sign continued 
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® These variances (Subequations a and b) are given solutions wherein V2 assumes a value of 1.4141 in la 
throughout, and wherein y2 assumes a value of —1.4141 in 1b. 

» Solutions 10 through 14 are for phrases containing radicals. These phrases were treated, as in Table 1, as 
single unknowns in the solution of the original equations. The correlations ret.we etc., can be directly obtained from 
these solutions by dividing the solution obtained by the product of the corresponding squares of the variances 
indicated on the left-hand side of jhe equation. Or, if it is desired the four solutions (6-9) may be used and by direct 
substitution, the values for rwa etc., can be obtained. In Equations 6 through 9, only one solution is given 
to save time and space If the plus values of 2 are used in obtaining the variances (Equations 1-5) consistency 
must be followed by using plus values throughout 
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TABLE 4 


l wh- we wh? we’ = 


+(1—.5v2)o*pwrt+(v2—1)e?y44+(.5—.5 v2) 02 97+.25vVJe% ur 


=. Se err —.Se7 TT 


+.29300? pwr +.414loe?y4 —.20700?% 97 4+.3535e% 7 


So*prr—.So*:7T 
+1.70700? pwr —2.4141e?v4+1 


* - 


- Sa*s7+ (.5+.25v2) ce? ur— | 


+.( 5+ 


— 50? airre+.353507s4+.35350° BBE 
? 


0700? s7 —.3535e°% vr 
5+. 


25v2)e*asr—.Se*anrt+o*va 
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-Continued 


+.25v2e'* asr— .25Vle*ur 


K V(1+.5v2)e7%s44+(1 +.5v2)o"p ar —2e*vato*anr— (1 +.5v2)e2 asp —.5Vle2ur 


Se err — Serr —.dso*airrr —.25V207s4—.25Vl0*part+.25V2e' esr 


[107 


= 5e*piTTr - .3535e7 84 = 353507 par + 35350 Bsr 


[10a] 
35350 Bsr 
[10b] 


5v2)e?s4—(.5+.25vV2)o* sar 


(11) 


Se? 57 +.85350% yr — 85350734 —.85350°? ger t+.853507% esr —.S5e*anrt+o*va [ila] 


— Se? sr +.146507% ur — 1465075 4 —-146507 per +.14650% asp —.50*awrt+o*va [11b] 


25¢7s4 


204 —. 5o* pwr —. 85350 par t+. 853507 Bsr 
646567? 54+1.50?v4—.50° pwr —.14650*% gar +.146507 B55 


bhed v2e"; 2 « Svle* ave +.25vVie'psr—.25Vle* vr 


1” shedeObhote = 1.414102y 4—.70700" 5.4 


1 bhebeObhT be = 


inferences in which one is interested, 
as a starting point for further theoreti- 
cal psychological developments and 
experiments. Yet determining the 
distribution and the confidence limits 
of ratios presents an additional chal- 
lenge to statistical inference. 

The typical form of any solution 
for an abstract variance, as in Tables 3 
and 4 above, may be written: 


+ ko’, 


[2] 


where 6, is the estimated abstract 
variance, e.g., within family environ- 
ment; k&,, ks, etc., are numerical, 
constants; and o?;, etc., are the esti- 
mated concrete variances for particu- 
lar family relation population species. 
Our need is to determine the variance 
of the variance on the left, and to do so 
it is necessary to decide witéther the 


6, = kyo", + kyo: «+: 
2 i i ~ - 


250° gar +.250* gsr 


-. 70700? gnr+.3535e° asp —.3535e0° ur 


-1.4141e?p4+.7070 6% ¢4+.70700? pyr —.3535e% gsr t+.3535e rr 


(12) 


‘UA— 50° BNF 


[13] 
(13a ] 
[13b] 

[14] 
[14a] 
[14b ] 


variances on the right are correlated. 
Usually within and between mean 
squares are obviously uncorrelated for 
any one particular analysis. In this 
case, since they are quite different 
groups of subjects (Ss), it might be 
argued that the chances of systematic 
correlation are still less. However, 
the question is actually a very com- 
plex one, to be answered only by 
study of the structure of the data in 
For example, in some 
instances of sibling pairs, they are not 
random subgroups from the larger 
group. The concensus of expert sta- 
tistical opinion consulted nevertheless 
is that, in terms of possible correlation 
over a series of samples, we are justi- 
fied in treating these as uncorrelated 
variances. Accordingly the variance 
of the abstract variance estimate is a 


each case. 
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simple sum of the independent func- 
tions of the experimentally obtained 
variance estimates on the right of our 
equations, i.e., 


e 2>k;70;' 3° 
~” = a= I [3] 
where 7 is any subgroup. 
As stated above, it is likely in most 
actual experiments, unless one is pre- 
pared to throw away hard won data 
that the numbers in the various ex- 
perimentally obtained family constel- 
lation subgroups (sibs, twins, etc.) 
will differ. If the successive groups in 
Equation 1 have m, m2, etc. cases the 
above estimate of the variance of the 
variance is stated more completely as: 


2k;70;4 
nm,—1 
2k? 


m2 — 1 


Cea? = 


- 


However, if ” is small—less than 30 
cases (unlikely in these experiments) 

the degrees of freedom which give 
an unbiased estimate become n, + 1, 
etc., according to Daniels (1939) and 
Welch (1956). 

Thus the confidence limits for a 
given abstract variance can be desig- 
nated as: 


V2Dko? : 
a —— <6? <o," 
n;—1 


Pr |ox-Z, 


LZ V2Sk.02 _p a 
TS; —s ——s (S] 
where Z; is the normal standard de- 
viate for a given probability, P;; ¢,? 
the “‘true’’ value for the population 
variance, and we assume that, through 
large m or the use of an unbiased esti- 
mate, the distribution of the variance 
is normal. 

Finally, to determine the confidence 
limits of heredity-environment vari- 
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—— Tor” 
ance ratios, e.g., a 
The 


or, in general 


Ca" , , - 
52 We simplify the probability state- 


ments for numerator and denominator 
as follows: 


Pr[A < 62 < B] =P, 
Pr[C <62 <D]=P; 


[ 6a | 
[6b } 


The combined probability for the 
ratio is not strictly the product P,P, 
because P,; and P: are not entirely 
independent; but when using the ex- 
treme confidence interval ranges (.95 
to .99) in which we are alone interested, 
the error from assumption of inde- 
pendence is very small’ and we 
may use: 


> A ae Ga 
Pr | 4 ae’ *, 

As with any confidence limits test, 
it is not possible to decide, without 
actual empirical variance data carry- 
ing experimental error, what the de- 
sirable number of cases should be in a 
well planned experiment. However, 
with such a major experimental 
undertaking as is involved in a really 
well designed MAVA experiment, to 
determine nature—nurture ratios for 
some particular primary personality 
source trait, obtaining the earliest 
possible indication from experience 
as to what is adequate is vital. For- 
tunately, a lead can now be obtained 
by applying the above formulae to the 
sample sizes achieved in the first 
investigations of inheritance of pri- 
mary personality factors by Cattell, 
Blewett, and Beloff (1955) and Cattell, 
Kristy, and Stice (1957). These 
studies measured a total of 1,024 
eleven- and twelve-year old children 


B , 
< 2 | = P,P; [7] 


7 For example with both P; and P; equal 
to .90 and assuming independence we get 
P,P, = .81, whereas assuming lack of inde- 
pendence this becomes .80. 
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on all personality factors established 
up to that time, using, first, question- 
naire measures (the High School 
Personality Questionnaire) and, sec- 
ondly, objective tests (the Children’s 
Objective-Analytic Battery). Taking 
for illustration first a factor from the 
questionaire, and one with the highest 
of hereditary determinations (next to 
intelligence), we choose H factor, 


parmia vs. threctia, a temperament 
factor believed associated with para- 
sympathetic system 


predominance. 


an Two om 
The value . came out at = 52. 


7 
Cwh 1.39 
making heredity about twice as im- 
portant, within the family as en- 
vironment. According to our present 
formulae above the P = .05 limits 
are approximately .18 and 1.39. 
Taking as different a case as possi- 
ble for our second illustration we turn 
to a predominantly environmentally 
determined factor and one measured 
in objective tests. Let us examine the 
between family variance. For this 
we chose UI 23 (Cattell, 1957) which 
Eysenck has called the general neu- 
roticism factor and which we call 


Cr 
value of - 
Thr” 


the P = .05 


ergic regression. The 
came out at 6.7, and 
limits by the above formulae stand 
at 1.9 and 32.0. All values make 
environment important than 
heredity, but the range in this case 
is very large. 


more 


It is well known that as one extends 
the number of linear combinations 
from which estimations are made the 
magnitude of the standard error in- 
creases in the disheartening fashion 
seen in the above results. However, 
the conclusion to be drawn, 
knowledge of the relative environ- 
mental and hereditary contributions 
on continuous personality and ability 
dimensions is of great theoretical and 
practical importance (and since the 


since 
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twin method has similarly large sam- 
pling errors and is additionally, fun- 
damentally amiss in its assumptions !) 
is surely that research must foot the 
bill. To say that the method is self- 
defeating because it requires 10,000 
rather than 1,000 cases is to use an 
argument which, applied to the an- 
alogous wearisome organization re- 
quired in separating radium, would 
have brought the work of the Curies 
to a standstill, or, in our own genera- 
tion, would have blocked the way to 
atomic energy. Those who believe 
that advance in the human sciences 
is as important as in the physical 
sciences will not blink at the expendi- 
ture which, though large compared 
with a rat experiment, is still a 
hundredth of that in several current 
physical science problems. At present 
(unless one pursues the mirage of 
analogy from animal experimenta- 
tion) the MAVA method, despite its 
experimental-organizational arduous- 
ness, is the only way either to check 
basic theories or, what is more im- 
portant, generate more exact and 
potent hypotheses, as discussed be- 
low. Nevertheless, it is presented 
here as unfinished in development, 
and requiring improvements, notably 
in refinement of the estimation of 
standard the the 
attenuation correction, and in regard 
to the assumption of noncorrelation 
in Equation 7, etc. These issues 
would require separate papers in 
themselves. 

While no space can be given here 
to heading off beforehand the numer- 
ous misunderstandings possible on so 


errors, in use of 


complex a topic, we would point out 
that perusal of the discussion of as- 
sumptions in their psychological con- 
text in the earlier articles (Cattell, 
1953; Cattell, Blewett, & Beloff, 1955; 
Cattell, Kristy, & Stice, 1957) will 
make the present statements of for- 
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mulae, etc., more meaningful to the 
psychologist. 


PSYCHOLOGICAL THEORY IMPLICIT 
IN THE MAVA MOobDEL 


What has been propounded above 
is not a theory, in the sense of most 
discussed in this Review, but a model 
Burt and Howard (1957), and a few 
others on intelligence, and of Cattell, 
Beloff, and Blewett (1955), and Cat- 
tell, Kristy, and Stice (1957) on per- 
sonality, nothing existed to restrict 
and a method, by which a consider- 
able array of theories can be expressed 
and tested. In this area the vital need 
of psychology has been the initial 
appearance of some dependable rela- 
tionships out of which a_ theory 
worthy of that name, rather than 
mere speculation, can be born. Apart 
from the work of Burks (1928), 
Burt and Howard (1957), and a few 


others on intelligence, and of Cattell, 
Beloff, and Blewett (1955), and Cat- 
tell, Kristy, and Stice (1957) on per- 
sonality, nothing existed to restrict 
speculators from ascribing anything 
from 0.1% to 99.9% of the variance 


in any human trait to whichever 
influence their prejudices favored. 

The restrictions which the present 
model puts upon theories (additive 
hereditary and environmental effects, 
interaction of heredity and environ- 
ment by correlation and covariance 
but not statistical interaction) are 
slight compared to the aid it gives in 
checking many theories. Among the 
theories deserving experimental in- 
vestigation are: 

1. The position adopted in our own 
investigations that the variance ratio 
for any factorially composite piece of 
behavior will be middling but that 
more clear-cut predominance of hered- 
ity or environment will be found for 
factor-pure source traits (Cattell, 
1946). Although it is theoretically 
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possible, as mooted by Thomson and 
the present writer (1946) that a 
genetic factor (gene) may appear as 
a statistical factor it is almost certain 
that constitutional factors among the 
psychologists factors will be poly- 
genic. Nevertheless there is a tend- 
ency for factors to be predominantly 
either environmental or hereditary 
(Cattell, Kristy, & Stice, 1957). 

2. The theory that the occurrence 
of environmental influences at par- 
ticular (early) ages, as in Freudian 
theory or in imprinting (Hess, 1959), 
is more effective than at others. In 
animals this theory can be tested by 
manipulative experiment, but in hu- 
mans only by some form of the 
MAVA method applied to groups with 
different histories. In this connection 
an editorial critic has asked, ‘“‘How 
can one possibly separate maturation 
from learning, age decline, etc., when 
these operate differently at different 
The answer is that it is pre- 
cisely through, and only through, 
first obtaining these ratios for strategi- 
cally chosen age levels, cultures, etc., 
that further inferences can be made 
regarding the periods of life, or the 
subcultural influences, determining 
the relative action of maturation and 
learning. For example, it was only 
through the discovery of the predomi- 
nantly negative value of the heredity 
environment correlations, from the 
above experiments, that the “law of 
coercion to the bio-social mean’”’ (see 
6 below) became justified as a worth- 
while proposition regarding heredity 
environment interaction. 

3. The theory that, contrary to 
popular thinking, the ratio of heredi- 
tary influence will be higher with older 
Ss. This assumes that younger chil- 
dren will have been exposed to a 
smaller sample of the total environ- 
ment and will experience in this ex- 
posure the greater variance of means 
characteristic of smaller samples. The 


ages ?”’ 
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remaining un- 

nurture 

greater 
the 


variance 
the 
therefore be 


hereditary 
ratio 
when 


MAVA 


changed, nature 
should 
older Ss are used in 
experiment. 

4. A number of interesting theories 
are frequently entertained about the 
effects of family positions and con- 
stellations, of age differences between 
sibs and between children and parents, 
upon personality traits. Such theories 
can be systematically investigated by 
determining the relative environ- 
mental contributions for strategically 
chosen groups of Ss in repeated 
MAVA experiments. 

5. The importance of a 
“family 


general 


atmosphere,” e.g., in the 


genesis of neurosis, can be checked 


the comparison of within- 
and between-family environ- 
variance on the MAVA ap- 


measures of neuroticism 


through 
family 
mental 
plied to 
factors. 
6. A variety of theories needs to be 
tested in regard to the causal direc- 
tions in discovered correlations of 
heredity and 
positive correlation, for intelligence, 
mean that brighter children create a 
richer environment or that for social 
reasons they tend to be born into one? 
This can be answered partly by the 
comparison Of Ties, aNd wesw. Does 
the large correlation of 
heredity and environment of the ergic 


environment. Does a 


negative 


regression factor, UI 23, (Eysenck’'s 
‘neuroticism factor’’) require a theory 
that the avoid the 
stresses that might precipitate neu- 
rosis? The general finding that 
hereditary and environmental influ- 
ences tend to be negatively correlated 
with the more substantially inherited 
traits has already been the source of a 
theory (Cattell, 1957) of ‘cultural 
coercion to the bio-social norm.” 

7. A variety of theories in cultural 
and social psychology 


neurosis-prone 


anthropology 
can now be examined through a com- 
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parison of the MAVA experimental 
results i different countries. For ex- 
ample, a democratic organization of 
society, providing opportunity for 
all, should raise the importance of 
heredity relative to environment in 
determining, for example, perform- 
ance on intelligence tests. 

8. The model also permits examina- 
tion of theories regarding the effect of 
one trait upon another. A constitu- 
tional endowment of high  schizo- 
thymia (A factor), for example, is to 
be considered part of the environment 
for another trait in the same indi- 
vidual, e.g., high intelligence. <A 
comparison of MAVA findings on 
intelligence ratios for two types of 
Ss—those preselected to be high and 
low, respectively, in A factor—can test 
and shape initial theories. 

9. For the , ersonality psychologist 
the important theories of all 
are those concerning the hereditary 
or environmental origins of particular 
personality factors. For example, 
the finding that the H primary in the 
second-order extraversion factor (Cat- 
tell & Scheier, 1960) is largely heredi- 
tarily determined whereas F, sur- 
gency-desurgency, is equally a com- 
plete product of environmental learn- 
ing situations, throws considerable 
new light on extraversion. Also, at 
one stroke, it cuts the theories to 
be investigated regarding H to largely 
physiological ones, and those regard- 


most 


ing surgency to various possibilities 
in learning theory. <A lot of theo- 
retical wild goose chases are avoided. 
Equally theoretically provocative is 
the preliminary finding that Comen- 
tion, UI 20, which some have called 
a superego pattern, is actually heavily 
hereditarily determined. 

The above condensed survey may 
suffice to show that the role of MAVA 
findings in both forming and testing 
theoretical developments is consider- 
able, especially in view of the pro- 
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longed dearth of evidence in this field 


and the previous impossibility of 
formulating results in terms of within- 
and between-family hereditary and 
environmental effects, and the corre- 
lations among them. 
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MOTIVATION AND CUE UTILIZATION IN INTENTIONAL 
AND INCIDENTAL LEARNING’ 


DONALD H. KAUSLER 2 anno E. PHILIP TRAPP 


University of Arkansas 


In a recent theoretical article, Easter- 
brook (1959) has attempted to relate 
level of motivation to the utilization of 
irrelevant cues in a wide range of be- 
havioral situations. One of these situa- 
tions pertains to the familiar incidental 
learning paradigm in which subject (S$) 
is simultaneously exposed to relevant 
and irrelevant cues. Since this proce- 
dure provides no orienting instructions 
to ensure exposure to the irrelevant 
cues, cognizance of irrelevant cues 
wo. .d appear to be dependent upon the 
range or span of perception operating 
during performance on the intentional 
task (relevant cues). In general, as 
the range of perception extends beyond 
the central relevant cues, more periph- 
eral cues are utilized, thus permitting 
more irrelevant or incidental cue learn- 
ing ; conversely, as the perceptual range 
constricts, fewer peripheral cues are 
utilized and incidental cue learning is 
reduced. According to Easterbrook, 
drive increments produce a restriction 
in range of cue use. This restriction, 
he clearly points out, may facilitate or 
impair the learning of relevant cues, 
depending upon the complexity of these 
cues, but will always impair the learn- 
ing of peripheral irrelevant cues. As 
noted by Easterbrook, a similar posi 
tion has been taken by Bruner, Matter, 
and Papanek (1955) relating what they 
call “breadth” of learning to both over- 
motivation and overlearning. 

The present article is concerned fur- 
ther with motivation as a variable in 

1 This article 


grant from the National Science Foundation. 
2 Now at St. Louis University. 
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incidental learning. More specifically, 
it is intended to be an evaluation and 
an extension of the position taken by 
Easterbrook that increasing drive level 
during intentional learning reduces the 
efficiency of incidental learning through 
the resulting decrease in irrelevant cue 
utilization. 

However, before proceeding into the 
crux of the problem, some comments 
will be made on methodological pro- 
cedures in incidental-learning experi- 
ments which should help serve to define 
the scope and limitations of the pro- 
posed extension. 


METHODOLOGIES IN INCIDENTAL 
LEARNING EXPERIMENTS 


In experimental studies investigating 
the principles of learning as they apply 
to incidental versus intentional learning, 
two basic methodologies have evolved. 
The first methodology requires a group 
of Ss to learn a specified task inciden- © 
tally and, usually, a second group of Ss 
to learn the same task intentionally. 
That is, the intentional group learns the 
task with specific instructions to learn; 
the incidental group, on the other hand, 
is directed toward the task by some 
orienting instructions but is not in- 
structed to learn the task. The impor- 
tant and critical condition in this meth- 
odology is that Ss in the incidental 
group are given orienting instructions 
that assure perception of the learning 
material without an overt set to learn. 
Learning of the incidental material is 
then assumed to indicate the arousal 
of a covert or generalized set to learn. 
This methodology has been the pre- 
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dominant one employed in the large 
number of recent investigations of in- 
cidental learning (e.g., Brown, 1954; 
Postman & Adams, 1956a, 1956b, 1957, 
1958; Postman, Adams, & Bohm, 1956; 
Postman, Adams, & Phillips, 1955; 
Postman & Phillips, 1954; Saltzman, 
1953, 1956, 1957). Studies in this 
methodological framework typically ma- 
nipulate systematically either task, pro- 
cedural, or orienting variables in an 
attempt to clarify the nature of the 
processes involved in incidental learn- 
ing. Rarely, however, is motivation 
systematically manipulated within this 
framework. An important exception is 
a study by Spielberger, Goodstein, and 
Dahlstrom (1958) in which level of 


anxiety was investigated as a variable 


in incidental learning. In this study, 
however, only an incidental group was 
used without a corresponding group 
given instructions to learn. 

In the second methodology, each 
group of Ss employed in the experi- 
ment learns both an intentional task 
with instructions to learn and a differ- 
ent incidental task without instructions 
to learn. In other words, in this de- 
sign, S serves as his own control for 
both intentional and incidental learning. 
This methodology lends itself more 
readily to investigating the effects of 
motivational variables on both inten- 
tional and incidental learning than does 
the first methodology. It is apparent 
that the restricting influence of motiva 
tion formulated by Easterbrook is ap 
plicable mainly to this methodology. In 
fact, four of the five studies cited by 
Easterbrook (Bahrick, 1954; Kohn, 
1954; Silverman, 1954; Silverman & 
slitz, 1956) as providing evidence for 
the effects of motivational concentra- 
tion on irrelevant cue utilization in 
human learning are within this frame- 
work. The discussion that follows will 
therefore be restricted primarily to this 
methodology. 
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THE NATURE OF MOTIVATION AS A 
VARIABLE IN CUE UTILIZATION 


In his treatment of motivation as a 
variable in cue utilization, Easterbrook 
(1959) is referring specifically to emo- 
tionally based drives. He states: 


it is an empirically derived generalization 
that, when the direction of behavior is con- 
stant, increase in drive is associated with a 
reduction in the range of cue use. This paper 
is devoted to the expansion and exploration 
of this generalization (pp. 183-184) 


He further states: 


The term drive . .. refers to a dimension 


of emotional arousal or general covert ex- 


citement, the innate response to a state of 
biological deprivation or noxious stimulation, 
which underlies or occurs simultaneously 
with overt action and affects its strength and 
course. This emotional arousal is greater in 
neurotic than in normal subjects, greater 
than usual in subjects under stress or threat 
or in frustration, and in general greater in 
animals that have been “motivated” by any 
of the usual deprivations, noxious stimula- 
tions, or other incentives than it is in unmo- 
tivated or resting animals of the same species 
(p. 184). 


From these statements it seems ap- 
parent that Easterbrook is attempting 
to relate generalized drive (D), as con- 
ceptualized by the Iowa group, to cue 
utilization. 

One of the studies cited by Easter- 
brook as evidence for motivational re- 
striction, in human learning, however, 
clearly does not attempt to manipulate 
generalized D. This is the study by 
Bahrick (1954) in which motivation is 
heightened by offering financial re- 
wards as an incentive in the high mo- 
tivational condition. Moreover, in re- 
lating motivation to perceptual-motor 
performance, Easterbrook considers in 
detail a study by Bahrick, Fitts, and 
Rankin (1952) in which motivation 
was again manipulated by financial in- 
centives. This tendency on Easter- 
brook’s part to equate motivational 
states aroused by specific sets with gen- 
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eralized D seems open to serious crit- 
icism. It is on this point that the 
thinking of the present writers begins 
to differ. 

That the nature of the motivation 
operating during performance on the 
intentional task may be an important 
determiner of irrelevant cue utilization 
is evident from a recent study by 
Kausler, Trapp, and Brewer (1959) in 
which they duplicated the intentional 
and incidental tasks Bahrick 
(1954) with groups motivated by a 
generalized D, anxiety. In the Bahrick 
situation, motivation was clearly manip- 


used by 


ulated by an incentive-oriented set. He 
found that a 
learned a serial list of geometric forms 


high incentive group 


(relevant cues) more efficiently than a 


low incentive group. However, the 


high incentive group displayed signifi- 


less incidental learning (irrel- 
evant than the incentive 
group in the learning of colors filling in 
the geometric forms. These results are 
interpreted by Easterbrook as indicat- 


cantly 


cues ) low 


ing a restriction or funneling of cue 
high motivational 
The study of Kausler, Trapp, 
and Brewer (1959), however, yielded 
conflicting results. In this study the 
drive condition was generalized emo- 
tional D rather than incentive-oriented 
set. Two separate experiments were 
performed. In the first, anxiety was 
defined by score on the Taylor MAS; 
in the second, anxiety was induced by 


utilization in the 
gre up. 


intense ego involving instructions. In 
both experiments, as in the Bahrick 
study, the high drive group did signifi 
cantly better on the intentional task 
than the low drive group, but, unlike 
the Bahrick study, no difference was 
found between the two groups on the 
incidental task. 

In comparing the Bahrick (1954) 
and Kausler et al. (1959) studies, the 
differential feature nature 
of the motivation emploved, incentive- 


was in the 
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oriented set versus generalized D. 
Farber (1955) has emphasized that 
nongeneralized motives have directive 
aspects but do not have dynamogenic 
properties. He states: 

The supposition that motivational variables 
have steering properties as 
well as energizing and reinforcing (drive) 
properties does not, of course, imply that the 
two kinds of properties are identical. Indeed, 
the virtue of the foregoing analyses of the 
concept of motivation lies in their separation 
of these functions. The associative function 
of a motive or any other variable is identified 
in terms of its tendency to elicit or facilitate 
a limited class of responses only. The drive 
function of a variable is demonstrated if: 
(a) its presence energizes or intensifies in- 
discriminately all reaction tendencies existing 
in a given situation; and/or (b) its elimina- 
tion or reduction in magnitude is reinforc- 
ing, i.e., leads to the increased probability of 
recurrence, in the same situation, of the re- 
sponses preceding its modification. Motives, 
it is presumed, have both functions, and if a 
variable clearly does not have both, its status 
as a motive is questionable (pp. 311-312). 


(associative ) 


Thus, incentive-oriented sets would 
seem to possess associative properties 
without drive properties. That is, they 
elicit particular responses that are goal 
directed but do not energize other re- 
sponses that are irrelevant to the goal. 
In general, sets of this nature are ex- 
perimentally induced by encouraging 
efficient performance on the relevant 
task without intentionally arousing emo- 
tional responses. This is typically ac- 
complished by means of rewards or by 
task-oriented instructions. Generalized 
D, on the other hand, serves an energiz- 
ing function in that its presence intensi- 
fies indiscriminately all reaction ten- 
dencies existing in a given situation, 
both relevant and irrelevant to the cen- 
tral task. In general, D is experi- 
mentally manipulated by the intentional 
arousal of emotional responses during 
the learning process. This is typically 
accomplished by differentiating between 
Ss on some measure of intrinsic anxiety 
or by inducing anxiety through the use 


a 
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of ego involving instructions, failure, 
stress, etc. 

It is conceivable that in both the 
Bahrick and Kausler et al. studies in- 
crements in motivation, either associa- 
tive or drive, resulted in a reduction of 
habit strength for the irrelevant cues 
through the restriction of perceptual 
range proposed by Easterbrook. Thus, 
the high motivational groups acquired 
less irrelevant habit strength than the 
low motivational group. In the Bah- 
rick study this difference in habit 
strength is manifested because the dif- 
ference in motivation between the two 
groups is relevant only to the inten 
tional task and, therefore, does not 
enter into the multiplicative relationship 
(cf. Spence, 1958a, 1958b) between 
irrelevant habit strength and drive level. 
In the Kausler et al. study the gen- 
eralized D would be expected to inten- 
sify the relatively weaker habit strength 
in the high drive group, thus bringing 


its response pqtential to a point com- 
parable to that in the low drive group 
where habit strength is stronger but 


drive level is weaker. Hence, the 
groups would not differ in incidental 
learning. 


POSITIONING OF RELEVANT AND 
IRRELEVANT CUES 


The results of the Kausler et al. 
study also conflict with the studies by 
Silverman (1954) and Silverman and 
Blitz (1956) cited by Easterbrook as 
yielding supporting evidence for the 
restrictive influence of high motivation. 
In all three studies motivation was 
manipulated by an emotionally based 
generalized D. In the Silverman study 
anxiety was induced by threat of shock, 
and in the Kausler et al. study it was 
induced by ego involving instructions. 
In both the Silverman and Blitz and 
Kausler et al. studies intrinsic anxiety 
was employed by selecting extreme 
scorers on the MAS. 


AND E. Puitip TRAPP 


In both Silverman studies an increase 
in anxiety level was followed by the 
expected decrease in incidental learn- 
ing in constrast to the no difference re- 
ported in the Kausler et al. study. Al- 
though the drive state does not differ in 
the three studies, there is another im- 
portant differential between them. This 
is in terms of the relationship of the 
irrelevant cues to the relevant cues 
within the perceptual field. In the 
Kausler et al. study, as in the Bahrick 
study, the irrelevant cues are embodied 
within the relevant cues (colors within 
geometric forms), thus assuring close 
proximity, both spatially and tempo- 
rally, between relevant and irrelevant 
cues. Consequently, even though anx- 
ious Ss may have a restricted percep- 
tual range, the irrelevant cues may well 
fall within that range. In contrast, the 
relevant and irrelevant cues in the Sil- 
verman studies are widely spaced within 
the perceptual field. In fact, in the 
Silverman (1954) study, the relevant 
cues are perceived by a different sense 
modality (vision) than are the irrel- 
evant cues (audition). In the Silver- 
man and Blitz (1956) study, the irrele- 
vant cues (two-digit numbers) occupied 
a different section of a memory drum 
opening than did the relevant cues 
(nonsense syllables), and were sepa- 
rated from the relevant cues by six 
centimeters. It is very possible, there- 
fore, that in the Kausler et al. study the 
differential perception between high 
and low anxious groups was not as 
pronounced as in the Silverman studies. 

It is conceivable then that the amount 
of incidental learning under high and 
low motivation is affected differentially 
by position of the irrelevant cues within 
the perceptual field. That is, there is 
an interaction effect between level of 
motivation and cue positioning. In 
general, as the irrelevant cues are 
moved toward the periphery of the per- 
ceptual field, incidental learning will de- 
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crease, but the reduction will be greater 
for the high motivational levels than for 
Considerable systematic investi- 
gation is needed to establish the validity 
of this proposed hypothesis. 


low. 


TASK AND PROCEDURAL VARIABLES 


Other important variables to take 
into account in evaluating the effects 
of increased motivation on intentional 
and incidental learning are variables re- 
lating to the nature of the intentional 
task and method of presenting the in- 
tentional task. Particularly important 
variables would seem to be variables re- 
lating to the difficulty of the intentional 
task, such as, length, intratask com- 
petition, rate of presentation, etc. A 
number of studies employing the first 
methodology discussed earlier have sys- 
tematically investigated these variables 
and demonstrated their importance in 
incidental learning studies (e.g., Nei- 
mark & Saltzman, 1953; Postman & 
Adams, 1957; Postman, Adams, & 
Phillips, 1955). 

Differences in the tasks used, both 
intentional and incidental, make it more 
arduous to compare the results of stud- 
ies with the simultaneous relevant-irrel- 
evant cue methodology. This is espe- 
cially true when motivation is varied 
qualitatively from study to study. 
Kausler and Trapp (1959) have sug- 
gested that studies making cross-drive 
comparisons should use, whenever pos- 
sible, the same task in order to control 
for task effects. 

Some evidence that the efficiency of 
incidental learning under different lev- 
els of motivation may be related to task 
difficulty is indicated in the study of 
Spielberger, Goodstein, and Dahlstrom 
(1958). Using the first methodology 
but without an intentional group, they 
found that the relationship between 
anxiety level (as defined by score on 
the MAS) and _ incidental learning 
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(learning the designs on the Bender- 
Gestalt test) was a function of the 
difficulty of the learning material. On 
the easy designs the high anxious Ss 
performed at a higher level than the 
low anxious Ss; on the difficult designs 
the low anxious Ss performed at a 
higher level than the high anxious Ss. 

The present writers believe that task 
difficulty variables operate in the inten- 
tional-incidental learning situation by 
altering the degree of concentration re- 
quired of Ss. Rate of presentation, for 
example, would be important in that 
the faster the rate, the greater the de- 
gree of concentration demanded. It is 
here suggested that intentional task var- 
iables which alter the degree of concen- 
tration required of S will affect further 
the range of attention for simultane- 
ously present irrelevant ones. How- 
ever, the net effect on the learning of 
irrelevant cue habits should also be a 
function of the nature of the motivation 
present (incentive-oriented versus gen- 
eralized D), the level of motivation, and 
the positioning of irrelevant cues within 
the perceptual field. Thus studies re- 
lating degree of motivation to irrelevant 
cue utilization should also take into ac- 
count the nature of the drive, the nature 
of the intentional task, and the position- 
ing of the irrelevant cues. 

For example, on easy tasks which 
require little concertraticn, high gen- 
eralized drive Ss snur.! display more 
incidental learning than iow drive Ss 
(as in the Spielberger et al., 1958, 
study ). The perceptual range should be 
fairly broad under these conditions and 
should incorporate irrelevant cues within 
close proximity to the relevant cues. 
Since generalized D reacts multiplica- 
tively with irrelevant habit strength, the 
reaction tendency for high drive Ss 
should be greater than for low drive Ss. 
On difficult tasks, however, the percep- 
tual range for irrelevant cues should be 
much narrower. Consequently, when 
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combined with the further restricting 
effect of high drive, the attention for 
these cues and subsequent habit strength 
should be considerably lower for high 
drive Ss than for low drive Ss. Under 
these conditions, low drive Ss should 
to display superior incidental! 
learning. 

With incentive-oriented motivation, 
however, low motivation might be ex- 
pected to result in superior incidental 
learning regardless of intentional task 
difficulty. This is likely because the 
motivation involved does not enter into 
a multiplicative relationship with habit 
strength. Nevertheless, the difference 
in incidental learning proficiency be- 
tween high and low incentive groups 
could be expected to diminish as the 
intentional task decreases in difficulty. 
That is, the highly motivated Ss would 
need to concentrate less on easy tasks, 
thus reducing the funneling effect on 
their attention to irrelevant cues; con- 
versely, on difficult tasks requiring in- 
tense concentration, the funneling effect 
of high incentive motivation should be 
most pronounced. 


begin 


It is obvious that considerable sys- 
tematic research is needed, manipulat- 
ing task variables along with motiva- 
tional and positional variables, before 
the overall effects of level of motivation 
on irrelevant cue utilization can be de 
terminéd. It does seem clear, however, 
that the phenomena Easterbrook seeks 
to embrace are too complex to be uni- 
fied under one general principle, appeal- 
ing as the thought may be. Even at 
this stage, the empirical data demand 


some extensions to his formulations. 


The elaborations presented here do 


reconcile existing conflicting findings. 
How well they can handle derivable 
hypotheses, like those suggested in the 
foregoing, remains to be seen. Studies 
to that effect are currently in process in 
our laboratories. 


Puitie TRAPP 


SUMMARY 


Kasterbrook has proposed that in- 
creasing motivation (specifically defined 
as emotional D) results in a funneling 
or focusing of attention to relevant cues 
when Ss are simultaneously exposed to 
relevant and irrelevant cues. The pres- 
ent article suggests that the net effects 
of increasing motivation on cue utiliza- 
tion during incidental learning are func- 
tions of additional variables as well. Of 
particular importance seem to be the 
nature of the motivation (generalized 
D versus incentive-oriented set), posi- 
tion of the irrelevant cues relative to the 
relevant cues (central versus periph- 
eral), and difficulty of the intentional 
task. This extension of Easterbrook’s 
framework brings on a rapprochement 
with findings and 
opens the way to more systematic in- 
vestigations into an admittedly complex 
area. 


existing research 


REFERENCES 

Baurick, H. P 
two incentive conditions 
1954, 47, 170-172 

Banrick, H. P., Fitts, P. M., & RANKIN, 
R. E. Effects of incentives upon reactions 
to peripheral stimuli. J. erp. Psychol. 
1952, 44, 400-406. 

Brown, G. H. Factors influencing incidental 
learning. J. exp. Psychol., 1954, 47, 163- 
169 

Bruner, J. S., Matter, JEAN, & PAPANEK, 
MirIAM 3readth of learning as a func- 
tion of drive level and mechanization. 
Psychol. Rev., 1955, 62, 1-10. 

EAsTerBrook, J. A. The effect of emotion 
on cue utilization and the organization of 


Psychol. Rev., 1959, 66, 183 


Incidental learning under 
J. exp Ps) hol 


behavior. 
201 

FArper, I. FE. The role of 
verbal learning and performance. 
chol. Bull., 1955, 52, 311-327. 

Kauster, D. H., & Trapp, E. P. Methodo- 
logical considerations in the construct val- 
idation of drive-oriented scales. Psychol. 
Bull., 1959, 56, 152-157. 

Kauser, D. H., Trapp, E. P., & Brewer, 
C. L. Intentional and incidental learning 
under high and low emotional drive levels 
J. exp. Psychol., 1959, 58, 452-455. 


motivation in 
Psy - 





MOTIVATION 


Koun, H. 
»f experimentally induced stress situations 
upon certain aspects of perception and per- 
formance. J Psychol., 1954, 85, 
289-304 

NerMaArRK, Epitu, & SALttzMAN, I. J. Inten 
tional and incidental learning with differ- 
ent rates of stimulus presentation. 

Psychol., 1953, 66, 618-621. 

PostMAN, L., & ApAMs, PAULINE, A 
ies in incidental learning: III 
interference. J. exp. Psychol., 
323-328. (a) , 

PostMAN, L., & ApAmMs, PAULINE A. 
ies in incidental learning: 1V. The inter- 
action of orienting tasks and stimulus 
materials. J. exp. Psychol., 1956, $1, 329 
333. (b) 

PostMAN, L., & ApAMs, PAULINE A. 
ies in incidental learning: VI. 
interference. J. exp. Psychol., 
153-167 


STMAN, I 


Effects of variations of intensity 


genet, 


{ -Y 
simer. 


Stud 
Interserial 


1956, 51, 


Stud 


Stud- 
Intraserial 
1957, 54, 


, & Apams, Pautine A._ Stud 

idental learning: VII. Effects of 
frequency of exercise and length of list. 
J. exp. Psychol., 1958, 56, 86-93. 

PostMAN, L., ADAMs, A., & Bou 
A. M. Studies in incidental learning: V 
Recall for order and associative clustering 
I. exp. Psychol., 1956, §1, 334-342 

PostMAN, L., Apams, Pau.ine A., & Put- 
Lips, Laura W Studies in 
learning: II. The effects of association 
value and the method of testing. J. exp. 
Psychol., 1955, 49, 1-10 


es in in 


PAULINE 


incidental 


AND CUE UTILIZATION IN 


INCIDENTAL LEARNING 379 

PostMAN, L., & Puitiips, Laura W. Stud- 
ies in incidental learning: I. The effects of 
crowding and isolation. J. exp. Psychol., 
1954, 48, 48-56. 

SALTZMAN, I. J. The orienting task in in- 
cidental and intentional learning. Amer. 
J. Psychol., 1953, 66, 593-597. 

SALTzMAN, I. J. Comparisons of incidental 
and intentional learning with different 
orienting tasks. Amer. J. Psychol., 1956, 
69, 274-277. 

SaitzMAN, lL. J. Incidental and intentional 
memory for lifted-weights. Amer. J. Psy- 
chol., 1957, 70, 253-257. 

SILVERMAN, R. E. Anxiety and the mode of 
response. J. abnorm. soc. Psychol., 1954, 
49, 538-542. 

SILVERMAN, R. E., & Butz, 
and two kinds of anxiety. 
Psychol., 1956, 52, 301-303. 

Spence, K. W. A theory of emotionally 
based drive (D) and its relation to per- 
formance in simple learning situations. 
Amer. Psychologist, 1958, 13, 131-141. (a) 

Spence, K. W. Behavior theory and selec- 
tive learning. In M. R. Jones (Ed.), 
Vebraska symposium on motivation. Lin- 
coln: Univer. Nebraska Press, 1958. Pp. 
73-107. (b) 

SpreLBERGER, C. D., Goopstein, L. D., & 
Dau strom, W. G. Complex incidental 
learning as a function of anxiety and task 
difficulty. J. exp. Psychol., 1958, 56, 58 
6l. 


B. Learning 
J. abnorm. soc. 


(Received September 29, 1959) 





Psychological Review 
1960, Vol. 67, No. 6, 380-400 


BLIND VARIATION AND SELECTIVE RETENTION 
IN CREATIVE THOUGHT AS IN OTHER 
KNOWLEDGE PROCESSES * 


DONALD T. CAMPBELL 


Northwestern University 


This paper proposes to examine 
creative thought within the framework 
of a comparative psychology of knowl- 
edge processes, and in particular with 
regard to one theme recurrent in most 
knowledge processes. This theme may 
be expressed as follows : 


1. A blind-variation-and-selective-retention 
process is fundamental to all inductive 
achievements, to all genuine increases in 
knowledge, to all increases in fit of system 
to environment. 

2. The many processes which shortcut a 
more full blind-variation-and-selective-reten- 
tion process are in themselves inductive 
achievements, containing wisdom about the 
environment achieved originally by blind 
variation and selective retention. 

3. In addition, such shortcut 
contain in their own operation a blind- 
variation-and-selective-retention process at 
some level, substituting for overt locomotor 
exploration or the life-and-death winnowing 
of organic evolution. 


processes 


experimental 
physicist and some virus-type ancestor 
there has been a tremendous gain in 


Between a modern 


knowledge? about the environment. 

1A partially overlapping version of this 
paper was presented at the Inter-Discipli- 
nary Conference on Self-Organizing Systems, 
sponsored by the Office of Naval Research 
and the Armour Research Foundation of the 
Illinois Institute of Technology, Chicago, 
May 5-6, 1959. The proceedings of the con- 
ference are to be published by Pergamon 
Press under the title Self-Organizing Sys- 
tems. The author is indebted to Carl P. 
Duncan for contributing to the development 
of many of the points involved. 

2 This extended usage of “knowledge” is 
a part of an effort to put “the problem of 
knowledge” into a behavioristic framework 
which takes full cognizance of man’s status 
as a biological product of an evolutionary 
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In bulk, this has represented cumulated 
inductive achievements, stage by stage 
expansions of ‘knowledge beyond what 
could have been deductively derived 
from what had been previously known. 
It has represented repeated “breakouts” 
from the limits of available wisdom, for 
if such expansions had represented 
only wise anticipations, they would 
have been exploiting full or partial 
knowledge already achieved. Instead, 


development from a highly limited back- 
ground, with no “direct” dispensations of 
knowledge being added at any point in the 
family tree. The bibliographical citation 
of the several sources converging on this 
approach to the problem of knowledge, and 
the discussion of its relation to traditional 
philosophical issues and to the strategy of 
science are presented elsewhere (Campbell, 
1959). Suffice it to say here that the posi- 
tion limits one to “an epistemology of the 
other one.” The “primitives” of knowledge 
can not be sought in “raw feels” or in 
“phenomenal givens,” or in any “incorrigible” 
elements. While man’s conscious knowl- 
edge processes are recognized as more com- 
plex and subtle than those of lower organ- 
isms, they are not taken as more funda- 
mental or primitive. In this perspective, 
any process providing a stored program for 
organismic adaptation in external environ- 
ments is included as a knowledge process, 
and any gain in the adequacy of such a 
program is regarded as a gain in knowledge. 
If the reader prefers, he can understand 
the paper adequately regarding the term 
“knowledge” as metaphorical when applied 
to the lower levels in the developmental 
hierarchy. But since the problem of knowl- 
edge has resisted any generally accepted 
solution when defined in terms of the con- 
scious contents of the philosopher himself, 
little seems lost and possibly something 
gained by thus extending the range of 
processes considered. 
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real gains must have been the products 
of explorations going beyond the limits 
of foresight or prescience, and in this 
sense blind. In the instances of such 
real gains, the successful explorations 
were in origin as blind as those which 
failed. The difference between the 
successful and unsuccessful was due to 
the nature of the environment en- 
countered, representing discovered wis- 
dom about that environment. 

The general model for such induc- 
tive gains is that underlying both 
trial-and-error problem solving and 
natural selection in evolution, the 
analogy between which has _ been 
noted by several persons (e.g., Ashby, 
1952; Baldwin, 1900; Pringle, 1951). 
Three conditions are necessary: a 
mechanism for introducing variation, a 
consistent selection process, and a 
mechanism for preserving and repro- 
ducing the selected variations. In what 
follows we shall look for these three 
ingredients at a varietv of levels. But 
first a comment on the use of the word 
“blind” rather than the more usual 
“random.” It seems likely that Ashby 
(1952) unnecessarily limited the gen- 
erality of his mechanism in Homeostat 
by an effort to fully represent all of 
the modern connotations of random. 
Equiprobability is not needed, and is 
definitely f lacking in the mutations 
which lay’ the variation base for organic 
evolution., Statistical independence be- 
tween oné variation and the next, while 
frequently desirable, can also be spared : 
in particular, for the generalizations 
essayed here, certain processes involv- 
ing systematic sweep scanning are 
recognized as blind, insofar as varia- 
tions are produced without prior knowl- 
edge of which ones, if any, will furnish 
a selectwo.thy encounter. An essential 
connotation of blind is that the varia- 
tions emitted be independent of the en- 
vironmental conditions of the occasion 
of their occurrence. A second important 
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connotation is that the occurrence of 
trials individually be uncorrelated with 
the solution, in that specific correct 
trials are no more likely to occur at any 
one point in a series of trials than an- 
other, nor than specific incorrect trials. 
(Insofar as observation shows this not 
to be so, the system is making use of 
already achieved knowledge, perhaps 
of a general sort. The prepotent re- 
sponses of an animal in a new puzzle 
box toward the apparent openings may 
thus represent prior general knowledge, 
transferred from previous learning or 
inherited as a product of the mutation 
and selective survival process.) A 
third essential connotation of blind is 
rejection of the notion that a variation 
subsequent to an incorrect trial is a 
“correction” of the previous trial or 
makes use of the direction of error of 
the previous one. (Insofar as mech- 
anisms do seem to operate in this 
fashion, there must be operating a 
substitute process carrying on the blind 
search at another level, feedback cir- 
cuits selecting “partially” adequate 
variations, providing information to the 
effect that “you’re getting warm,” etc. ) 


REVIEW OF THE THEME IN LOWER 
KNOWLEDGE PROCESSES 


In this perspective, the epistemo- 
logically most fundamental knowledge 
processes are embodied in those several 


inventions making possible organic 
evolution. At the already advanced 
level of cellular life, this is a “learning” 
on the part of the species by the blind 
variation and selective survival of 
mutant individuals. In terms of the 
three requirements, variation is pro- 
vided by the mutations, selection by 
the somewhat consistent or “knowable” 
vagaries of the environment, and pres- 
ervation and duplication by the com- 
plex and rigid order of chromosome 
mitosis. Bisexuality, heterozygosity, 
and meiotic cell division represent a 
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secondary invention increasing the ef- 
ficiency of the process through increas- 
ing the range of variation and the rate 
of readjustment to novel environments. 
The selection and preservation proc- 
esses remain the same. The ubiquity 
of bisexuality, its several independent 
inventions, and the multifarious elab- 
oration of the theme, all speak to its 
tremendous usefulness. 

The higher evolutionary 
ments shift a part of the 
adaptation away from a trial and error 
of whole organisms or gene pools, 
over to processes occurring within the 
Such processes are 


develop- 


locus of 


single organism. 
numerous, each being not only a device 
for obtaining knowledge, but also 
representing general wisdom about 
environmental contingencies already 
achieved through organic evolution, 
making possible more efficient achieve- 
ment of detailed local knowledge. One 
of the most primitive of these is ex- 
ploratory locomotion, described in the 


protozoa by Jennings (1906) and ac- 
cepted as a model for Homeostat by 


Ashby > (1952). Forward locomotion 
persists until blocked, at which point 
direction, of locomotion is varied blindly 
until unblocked forward locomotion is 
again pqssible. The external physical 
environment is the selection agency, the 
preservation of discovery is embodied 
in the perseveration of the unblocked 
forward movement. At this level, the 
species has “discovered” that the en- 
vironment is discontinuous, consisting 
of penetrable regions and impenetrable 
ones, and that impenetrability is to 
some extent a stable characteristic—it 
has discovered that when blocked it 
is a better strategy to try to go around 
than to wait until one can move 
through. 

Insofar as individual organisms with- 
out distance receptors (such as para- 
mecia and earthworms) can learn 
through contiguity, the species has al- 


ey 
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ready achieved the more general knowl- 
edge that there is some event-contin- 
gency stability in the environment. 
That is, in the degree to which indi- 
vidual learning is useful, there has 
been the species-level discovery of 
slower transformation processes on the 
part of relevant segments of the envi- 
ronment than of the organism. In ad- 
dition, whereas the ultimate selection 
is life or death in encounters with the 
external environment, the evolu- 
tionary stage at which learning is pos- 
sible, much of this once-external crite- 
rion has been internalized. Crude 
environmental contingencies with low 
selection ratios are now represented 
as pleasures or pains, or as reinforcers 
more generally. The selection be- 
comes much more sharp, but the con- 
tact with the environmental realities 
less direct. 

The presence of a fundamental trial- 
and-error process in individual learn- 
ing needs no elaboration or defense. 
Suffice it to say that recognition of 
such a process is found in all learning 
theories which make any pretense of 
completeness, including at least three 
of Gestalt inspiration (Campbell, 
1956a). While higher vertebrate (and 
higher cephalopod ) learning makes far 
more use of the short circuiting of 
overt trial and error by vision than is 
for by the usual learning 
theory (Campbell, 1956b), for con- 
venience here the multiplication of 
levels will be avoided by treating trial- 
and-error learning as a single process 
level. 

The next and most striking class of 
discoveries are those centering around 
echo-location and vision. Woodworth 
(1921) has emphasized the achieve- 
ment of a percept from the ingredients 
of sensation through a series of “trial- 
and-error perceptions.” Thurstone 
(1924) has interpreted perception as 
a trial and error of potential locomo- 
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tions placed in a hierarchy of trial- 


and-error processes including both 
overt trial and error and ideational 
trial and error, in a book containing 
many anticipations of cybernetic con- 
cepts. Pumphrey (1950) interprets 
the primitive sense receptor of the 
fishes called: the “‘lateral-line organ” as 
a crude echo-location device, making 
use of the reflected pulses of the fish’s 
own swimming. Griffin (1958) has 
documented in detail the use by bats 
and cave birds of sonic and supersonic 
vocalizations selectively reflected by 
obstacles of the environment. Kellogg 
(1958) has made a similar case for 
the porpoise. Here is a powerful 
substitute for blind locomotor explora- 
tion. (See Simon, 1957, p. 264, for 
an estimate of such gains.) In echo 
location a wave pulse is emitted blindly 
in all directions. The obstacles of the 
environment selectively reflect the 
pulse from certain of these directions, 
and thus provide a feedback which is 
substitutable for that which would 
have received had the animal 
locomoted in those directions. Radar 
guidance systems employ an analogous 
substitution of a blindly scanning elec- 
tromagnetic wave pulse, in economical 
substitution for a blind scanning of 
the .same environment of potential 
locomotions by full ship or projectile 


been 


movements. 

Visual perception seems interpret- 
able as a substitute search process of 
similar order (Campbell, 1956b). The 
full analogy is weakened by the absence 
of an emitting process on the part of 
the organism. Instead, advantage is 
taken of diffuse electromagnetic waves 
made available from external sources. 
Consider first a pseudoeye consisting 
of but a single photoreceptor cell 
(Such a device has been distributed 
for use by the blind in which a photo- 
cell output is transformed into a sound 
of variable pitch.) With such a device, 
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blind scanning as in a radar system is 
essential. Brightness contours can be 
located and fixated by continual cross- 
ing, as in the “hunting” process in a 
mechanical servosystem, or as in the 
vocal pitch control in which a steady 
note is “held” only by a continuous 
search oscillation (Deutsch & Clarkson, 
1959). To conceive of such an “eye” 
as a blind searching device substituting 
for a more costly blind locomotion ir 
the explored directions is not diffict:.. 
The eyes of insects and vertebrates and 
the higher cephalopods differ from 
such a device by having multiple photo- 
cells, making possible selective reflec- 


‘ tion from objects in multiple directions 


Each receptor cell can be 
pos- 


at once. 
conceived of as exploring the 
sibilities of locomotion in a given direc- 
tion, the retina collectively thus ex- 
ploring the possibilities of locomotion 
in a ‘wide segmerit of potential direc- 
tions for locomotion. Except as the 
eye is aimed by other sources of knowl- 
edge, these have been 
made “blindly” available without pre- 
science or insight. For the “blindness” 
of an eyeless animal there has been 
so efficient that 
we use it model for 
direct, unmediatéd knowing. But the 
process is still one of blind search and 
selective retention, in the em- 
ployed in this paper. 

Vision is a very complex and mar- 
velous mechanism, and the brief pres- 
entation here does not do justice even 
to the random search components in- 
volved. Hebb (1949) has well docu- 
mented the active search of eye move- 
ments, correcting the model of the 
inactive fixed-focus eye which is im- 
plicit in both Gestalt psychology and 
conditioning theory. Riggs (Riggs, 
Armington, & Ratliff, 1954) and 
Ditchburn (1955) have documented 
the essential role of the continuous low 
amplitude scanning provided by “phys- 


pt ssibilities 


substituted a process 


naively as a 


sense 
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iological nystagmus” or “fixation 
tremor.” Platt (1958) has provided a 
brilliant analysis of the role of a blind 
“rubbing” process, his “lens-grinding”’ 
model, for the achievement of visual 
acuity and spatial representation in a 
visual system containing unaddressed 
elements. These and other considera- 
tions convince the present writer that 
although vision represents the strongest 
challenge to the generality a_blind- 
variation-and-selective-retention aspect 
to all knowledge processes, it is not in 
fact an exception. These brief com- 
ments have not fully justified this con- 
clusion, however. 

Taking these echo location and visual 
exploratory processes collectively, sev- 
eral general aspects can be noted: all 
exploit a specific and limited coin- 
cidence, i.e., that objects impenetrable 
by organismic locomotion also are 
opaque to, or reflect, certain wave 
forms in the acoustical frequencies and 
in the bands of electromagnetic waves 
It is 


of the visual and radar spectra. 
this coincidence, unpredictable upon 
the basis of the prior knowledge avail- 
able to the more primitive organisms, 
which makes possible such marvelously 


efficient shortcuts. Thus while phe- 
nomenologically vision is more direct 
than other knowledge processes, it is 
seen in this perspective as an indirect, 
substitute process. As in all sub- 
stitute knowledge processes, the ef- 
fectiveness is limited by the accuracy 
of the coding process, i.e., the transla- 
tion terms between one level and 
another. Such coding is never ex- 
haustive (Platt, 1956). It always in- 
volves abstraction, and along with this 
some fringe imperfection and proneness 
to systematic error. It must finally 
be checked out and corrected by overt 
locomotion. Its efficacy is limited by 
the relevance of the coding to the more 
fundamental level of behavior for which 
it is a substitute. This relevance was 
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itself initially tested out by a_blind- 
variation-and-selective-retention proc- 
ess at the level of organic evolution or 
early childhood learning. (Species 
differ in this regard.) The phenomenal 
directness of vision tempts us to make 
vision prototypic for knowing at all 
levels, and leads to that chronic belief 
in the potential existence of direct and 
“insightful” mental processes, a belief 
which it is one purpose of this paper 
to deny. 


CREATIVE THOUGHT 


Creative thought provides the next 
level knowledge process for the present 
discussion. At this level there is a 
substitute exploration of a substitute 
representation of the environment, the 
“solution” being selected from the 
multifarious exploratory thought trials 
according to a criterion which is in 
itself substituting for an external state 
of affairs. Insofar as the three sub- 
stitutions are accurate, the solutions 
when put into overt locomotion are 
adaptive, leading to intelligent behavior 
which lacks overt blind floundering, 
and is thus a knowledge process. To 
include this process in the general plan 
of blind-variation-and-selective-reten- 
tion, it must be emphasized that insofar 
as thought achieves innovation, the 
internal emitting of thought trials one 
by one is blind, lacking prescience or 
foresight. The process as a whole of 
course provides “foresight” for the 
overt level of behavior, once the proc- 
ess has blindly stumbled into a thought 
trial that “fits” the selection crite- 
rion, accompanied by the “something 
clicked,” “Eureka,” or ‘aha-erlebnis” 
that usually marks the successful ter- 
mination of the process. 

Today, we find the blind-variation- 
and-selective-retention model most plau- 
sibly applied at the levels of organic 
evolution and trial-and-error learning 
of animals, and least palatable as a 
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description of creative thinking. His- 
torically, however, the phrase “trial 
and error” was first used to describe 
thinking, by Alexander Bain as early 
as 1855, two years before Darwin’s 
publication of the doctrine of natural 
selection. Not only for historical in- 
terest, but also to further develop the 
psychology of creativity, the following 
quotations from him (Bain, 1874) are 
provided : 


Possessing thus the material of the con- 
struction and a clear sense of the fitness or 
unfitness of each new tentative, the operator 
proceeds to ply the third requisite of con- 
structiveness—trial and error— .. . to 
attain the desired result. The number 
of trials necessary to arrive at a new 
construction is commonly so great that 
without something of an affection or fascina- 
tion for the subject one grows weary of the 
task. This is the emotional condition of 
originality of mind in any department 
(p. 593). 

In the process of Deduction . . . the 
same constructive process has often to be 
introduced. The mind being prepared be- 
forehand with the principles most likely 
for the purpose incubates in patient 
thought over the problem, trying and re- 
jecting, until at last the proper elements 
come together in the view, and fall into 
their places in a fitting combination (p. 594). 

With reference to originality in all de- 
partments, whether science, practice, or 
fine art, there is a point of character that 
deserves notice. . . I mean an Active 
turn, or a profuseness of energy, put forth 
in trials of all kinds on the chance of 
making lucky hits Nothing less than 
a fanaticism of experimentation could have 
given birth to some of our grandest practical 
combinations. The great discovery of 
Daguerre, for example, could not have 
been regularly worked out by any systematic 
and orderly research; there was no way but 
to stumble upon it. The discovery is 
unaccountable, until we learn that the 
author got deeply involved in trials 
and operations far removed from the beaten 
paths of inquiry (p. 595). 


In 1881 Paul Souriau presented a 
till more preponderant emphasis on 
the factor of chance as the sole source 


of true innovation. He asserts again 


and again that “le principe de I’inven- 
tion est le hasard.” In the main, 
he presents his argument through il- 
lustration and through the elimination 
of rival hypotheses about the inventive 
process, including deduction, induction, 
and “la methode.” A positive explana- 
tion of the process is hard to find. 
This sample will illustrate his ap- 


proach : 


A problem is posed for which we must 
invent a solution. We know the condi- 
tions to be met by the sought idea; but we 
do not know what series of ideas will lead 
us there. In other words, we know how the 
series of our thoughts must end, but not how 
it should begin. In this case it is evident 
that there is no way to begin except at 
random. Our mind takes up the first path 
that it finds open before it, perceives that 
it is a false route, retraces its steps and 
takes another direction. Perhaps it will 
arrive immediately at the sought idea, 
perhaps it will arrive very belatedly: it 
is entirely impossible to know in advance. 
In these conditions we are reduced to 
dependence upon chance. 

In the case just analysed we supposed 
that we had to solve a problem already 
stated for us. But how was the problem- 
statement itself found? It is said that a 
question well posed is half answered. If 
so, then true invention consists in the 
posing of questions. There is something 
mechanical, so to speak, in the art of find- 
ing solutions. The truly original mind jis 
that which discovers problems. But here 
again, it does no good to speak of method, 
since method is the application of already 
existing discoveries. The discovery of a 
new problem can therefore only be fortui- 
tous. Thus we see the role of logic diminish 
and that of chance increase as we approach 
closer to true invention. Chance is the 
first principle of invention: it is what has 
produced method, nourished it, and made it 
fertile. Method can only analyse the ideas 
which come to it from elsewhere, drawing 
out their consequences and exhausting their 
contents. Left to itself method soon be- 
comes sterile. Methodological minds can- 
not help having a feeling of disdain for 
adventurous minds which affirm before prov- 
ing and believe before knowing. But they 
must recognize that without such audacity, 
would be possible. The mind 
is not able to revise itself upon its own 


no progress 
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foundations. New ideas cannot have proto- 
types: their appearance can only be at- 
tributed to chance (pp. 17-18). 


Souriau has not only the notion of 
chance combinations, but also the con- 
cept of their being produced in large 
numbers which are generally worthless 
and from which only the rare ones 
fitting a goal or criterion are selected. 
These two widely separated quotations 
illustrate this: 


3y a kind of artificial selection, we can 
in addition substantially perfect our thought 
and make it more and more logical. Of 
all of the ideas which present themselves 
to our mind, we note only those which 
have some value and can be utilized in 
reasoning. For every single idea of a 
judicious and reasonable nature which of- 
fers itself to us, what hosts of frivilous 
bizarre, and absurd ideas cross our mind. 
Those persons who, upon considering the 
marvelous results at which knowledge has 
arrived, cannot imagine that the humar 
mind could achieve this by a simple fum- 
bling, do not bear in mind the great numbet 
of scholars working at the same time o1 
the same problem, and how much time even 
the smallest discovery costs them. Even 
genius has need of patience. It is after 
hours and years of meditation that the 
sought-after idea presents itself to the in- 
ventor. He does not succeed without going 
astray many times; and if he thinks him- 
self to have succeeded without effort, it is 
only because the joy of having succeeded 
has made him forget all the fatigues, all 
of the false leads, all of the agonies, with 
which he has paid for his success (p. 43). 

If his memory is strong enough to 
retain all of the amassed details, he evokes 
them in turn with such rapidity that they 
seem to appear simultaneously; he groups 
them by chance in all the possible ways; 
his ideas, thus shaken up and agitated in 
his mind, form numerous unstable ag 
gregates which destroy themselves, and 
finish up by stopping on the most simple 
and ‘solid combination (pp. 114-115). 


The phrase “artificial selection” is 


reminiscent of Darwin’s writings, al- 
though Souriau makes no mention of 
the selective-survival model of evolu- 


tion, (nor does he cite the ideas of 
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any other person whatsoever).*? Note 
how similar the final quotation is to 
Ashby’s (1952) phrasing of the inev- 
itable self-elimination of unstable com- 
binations : 

Just as; in the species, the truism that the 
dead cannot breed implies that there is a 
fundamental tendency for the successful to 
replace the unsuccessful, so in the nervous 
system does the truism that the unstable 
tends to destroy itself imply that there is 
a fundamental tendency for the stable to 
replace the unstable (p. vi). 


Ernst Mach was another great 19th 
century thinker about thinking who 
emphasized this model. We _ today 
remember him most as a psychologist- 
physicist-philosopher who contributed 
to the present day positivistic recogni- 
tion of the hypothetic character of our 
constructions of the world and who 
first made explicit the empirical pre- 
sumptions involved in the physicist’s 
assumption of an Euclidian space. But 
when, at the age of 57 in 1895, he 
was called back to his alma mater the 
University of Vienna to assume a 
newly created position of Professor of 
the History and Theory of Inductive 
Science, he chose a quite different 
theme for his inaugural address. His 
title was “on the part played by ac- 
cident in invention and discovery.” 
The occasion indicates the importance 
he gave to the message, and indeed, 

8 Souriau’s presentation is in general quite 
modern in spirit, although associationistic 
in a way some would find dated, and 
vigorously deterministic in a way now under- 
mined by subatomic physics, although not 
necessarily so for the problems of which 
he treats. He comments wisely on many 
topics not covered here, including simulta- 
neous independent invention and the Zeit- 
geist, the social conditions of creativity and 
invention, the dissonance created by dis- 
‘repant opinions of others, the congruence 
of free will and determinism, and both the 
conflict and interdependency between erudi 
tion and innovation. His attacks on both 
deduction and induction are reminiscent of 
Peirce’s later critiques 





BLIND VARIATION AND SELECTIVE RETENTION IN CREATIVE THOUGHT 


his paper is a neglected classic in the 
psychology of knowledge 
These quotations (Mach, 1896) fur- 
ther reinforce the 
thought being presented : 


processes. 


model of creative 


The disclosure of new provinces of facts 
before unknown, can only be brought about 
by accidental circumstances .. . (p. 168). 

In such [other] cases it is a psychical 
accident to which the person owes his dis- 
govery—a discovery which is here made 
“deductively” by means of mental odbles 
of the world, instead of 

171). 

After the repeated survey of a field has 
afforded opportunity for the interposition 
of advantageous accidents, has rendered all 
the traits that suit with the word or the 
dominant thought more vivid, and has 
gradually relegated to the background all 
things that are inappropriate, making their 
future appearance impossible; then from the 
host of fancies which a 
free and high-flown imagination calls forth 
suddenly that particular form 
the light which harmonizes perfectly with 
the ruling idea, mood, or design. Then it 
is that that which has resulted slowly as 
the result of a gradual selection, appears 
as if it outcome of a deliberate 
act of creation. Thus are to be explained 
the statements of Newton, Mozart, Richard 
Wagner, and when that 
thoughts, melodies, and harmonies had 
poured in upon that they had 
simply retained the right ones (p. 174 


experimentally 


teeming, swelling 


arises to 


1 
were the 


others, they say 


them, and 


Poincaré (1908, 1913) in his famous 
essay on mathematical invention pre- 
sents a point of view which is also 
judged to be in agreement. He first 
gives “One 
custom, I 


an example in imagery: 
evening, contrary to my 


drank black coffee and could not sleep 
Ideas 


them 
collide until pairs interlocked, so to 
speak, making a stable combination” 
(Poincaré, 1913, p. 387 
that it is rare for 
permuting 


rose in crowds: [| felt 


Poincaré 
this blind 
process to rise into con 
and that as a 
only the successful selected alternatives 


feels 


scious awareness, rule 


enter consciousness. Zecause of the 


relevance of Poincaré’s comments; be- 
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cause Hadamard (1945) has cited him 
in opposition to the accidentalist posi- 
tion while he is read here as favoring 
the selective-retention version of it; 
and because of all of the sources cited 
he would most generally be respected 
as truly creative (in the field of math- 
ematics) these longish excerpts (Poin- 
caré, 1913) are read into the record: 


It is certain that the combinations which 
present themselves to the mind in a sort 
of sudden illumination, after an unconscious 
working somewhat prolonged, are generally 
useful and fertile combinations, which seem 
the result of a first impression. Does it 
follow that the subliminal self, having 
divined delicate intuition that these 
combinations would be useful, has- formed 
only these, or has it rather formed many 
which were lacking in interest and 
have remained unconscious ? 

In this way of looking at it, all the 
combinations would be formed in con- 
sequence of the automatism of the sub- 
liminal self, but only the interesting ones 
would break into the domain of conscious- 
And still very mysterious. 
What is the cause that, among the thousand 
products of our unconscious activity, some 
are called to pass the threshold, while 
others remain below? Is it a simple chance 
which confers this privilege? Evidently 
not; among all the stimuli of our senses, 
for example, only the most intense fix our 
attention, unless it has been drawn to them 
More generally the priv- 
ileged unconscious phenomena, those sus- 
ceptible of becoming conscious, are those 
directly or indirectly, affect 
most profoundly our emotional sensibility 
(p. 391). 

we reach the following conclusion: 
The useful combinations are precisely the 
most beautiful, I mean those best able to 
charm this special sensibility that all mathe- 
maticians but of which the profane 
are so ignorant as often to be tempted to 
smile at it 

What Among the great 
numbers of combinations blindly formed by 
the subliminal self, almost all are without 
without utility; but just for 
are also without effect 
sensibility. Consciousness 
will never know them; only certain ones are 
harmonious, and, consequently, at once use- 
ful and beautiful. They will be capable of 


by a 


others 


ness. this is 


by other causes. 


which, 


know, 


happens then? 


interest and 
that 
upon the esthetic 


reason they 
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touching this special sensibility of the 
geometer of which I have just spoken, and 
which, once aroused, will cali our attention 
to them, and thus give them occasion to 
become conscious. 

This is only a hypothesis, and yet here 
is an observation which may confirm it: 
when a sudden illumination seizes upon the 
mind of the mathematician, it usually hap- 
pens that it does not deceive him, but it 
also sometimes happens, as I have said, 
that it does not stand the test of verification ; 
vell, we almost always notice that this 
false idea, had -it been true, would have 
gratified our natural feeling for mathe- 
matical elegance. 

Thus it is this special esthetic sensibility 
which plays the role of the delicate sieve 
of which I spoke, and that sufficiently 
explains why the one lacking it will never 
be a real creator. 

Yet all the difficulties have not disap- 
peared. The conscious self is narrowly 
limited, and as for the subliminal self we 
know not its limitations, and this is why 
we are not too reluctant in supposing that 
it has been able in a short time to make 
more different combinations than the whole 
life of a conscious being could encompass. 
Yet these limitations exist. Is it likely 
that it is able to form all the possible com- 
binations, whose number would frighten the 
imagination? Nevertheless that would seem 
necessary, because if it produces only a small 
part of these combinations, and if it makes 
them at random, there would be small chance 
that the good, the one we should choose, 
would be found among them. 

Perhaps we ought to seek the explanation 
in that preliminary period of conscious work 
which always precedes all fruitful uncon- 
scious labor. Permit me a rough compari- 
son. Figure the future elements of our 
combinations as something like the hooked 
atoms of Epicurus. During the complete 
repose of the mind, these atoms are motion- 
less, they are, so to speak, hooked to the 
wall; so this complete rest may be indefi- 
nitely prolonged without the atoms meeting, 
and consequently without any combination 
between them. 

On the other hand, during a period of 
apparent rest and unconscious work, certain 
of them are detached from the wall and put 
in motion. They flash in every direction 
through the space (I was about say the 
room) where they are enclosed, as would, 
for example, a swarm of gnats or, if you 
prefer a more learned comparison, like the 
molecules of gas in the kinematic theory of 
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gases. Then their mutual impacts may pro- 
duce new combinations. 

What is the role of the preliminary con- 
scious work? It is evidently to mobilize 
certain of these atoms, to unhook tnem from 
the wall and put them in swing. We think 
we have done no good, because we have 
moved these elements a thousand different 
ways in seeking to assemble them, and have 
found no satisfactory aggregate. But, after 
this shaking up imposed upon them by our 
will, these atoms do not return to their 
primitive rest. 
dance. 

Now, our will did not choose them at 
random; it pursued a perfectly determined 
aim. The mobilized atoms are therefore not 
any atoms whatsoever; they are those from 
which we might reasonably expect the de- 
sired solution. Then the mobilized atoms 
undergo impacts which make them enter into 
combinations among themselves or with 
other atoms at rest which they struck against 
in their course. Again I beg pardon, my 
comparison is very rough, but I scarcely 
know how otherwise to make my thought 
understood 

However it may be, the only combinations 
that have a chance of forming are those 
where at least one of the elements is one of 
those atoms freely chosen by our will. Now, 
it is evidently among these that is found 
what I call the good combination. Perhaps 
this is a way of lessening the paradoxical 
in the original hypothesis. 

Another observation. It never happens 
that the unconscious work gives us the result 
of a somewhat long calculation all made, 
where we have only to apply fixed rules. 
We might think the wholly automatic sub- 
liminal self particularly apt for this sort of 
work, which is in a way exclusively mechan- 
ical. It seems that thinking in the evening 
upon the factors of a multiplication we might 
hope to find the product ready made upon 
our awakening, or again that an algebraic 
calculation, for example a_ verification, 
would be made unconsciously. Nothing of 
the sort, as observation proves. All one 
may hope from these inspirations, fruits of 
unconscious work, is a point of departure 
for such calculations. As for the calcula- 
tions themselves, they must be made in the 
second period of conscious work, that which 
follows the inspiration, that in which one 
verifies the results of this inspiration and 
deduces their consequences. The rules of 
these calculations are strict and complicated 
They require discipline, attention, will, and 
therefore consciousness. In the subliminal 


They freely continue their 
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elf, on the contrary, reigns what I should 
call liberty, if we might give this name to 
the simple absence of discipline and to the 
disorder born of chance. Only, this dis- 
order itself permits unexpected combinations 
(pp. 392-394). 

In addition to these pioneers there 
of course have been numerous others 
who in some manner have made a sub- 
stitute trial and error in a modeled or 
mnemonic environment an important 
aspect of their description of thinking. 
In rough chronology these include 
Baldwin (1906), Pillsbury (1910), 
Rignano (1923), Woodworth (1921), 
Woodworth & Schlosberg (1954), 
Thurstone (1924), Tolman (1926), 
Hull (1930), Muenzinger (1938), Mil- 
ler and Dollard (1941), Craik (1943), 
Boring (1950), Humphrey (1951), 
Mowrer (1954), Sluckin (1954), and 
many others. 


OBJECTIONS TO THE MODEL 
The Gestalt Protest 


The trial-and-error theme in learn- 
ing was of course one part of the syn- 
drome of ideas against which Gestalt 
psychology eloquently protested. In 
spite of this, there is judged to be no 
inherent conflict between the perspec- 
tives of this paper and the Gestalt posi- 
tion. To make this interpretation, it is 
necessary to regard neither traditional 
associationism. nor Gestalt psychology 
as discrete integrated wholes, but in- 
stead to regard each as congeries of 


which the parts may be separately ac- 


cepted or rejected. 

The Gestaltists are judged to have 
validly rejected Thorndike’s (1898) de- 
scription of animal problem solving as 
solely a matter of overt locomotor trial 
and error. As this writer (Campbell, 
1956b) has argued previously, recog- 
nizing vision as a substitute trial-and- 
error process leads to the expectation 
that some locomotor problems will be 
solved by this means, obviating overt 


trial,and error. Even more so does 
the model for thought. To the present 
writer the Gestaltists were correct de- 
scriptively even though epistemologi- 
cally the trial-and-error process re- 
mains fundamental to discovery. In 
Wertheimer’s (1959) specific con- 
trasts between insightful problem solv- 
ing and blind trial and error, it is a 
trial and error of overt manipulation 
which is involved. Furthermore, as 
Humphrey (1951) and Woodworth 
and Schlosberg (1954) point out, the 
Gestalt descriptions of problem solving 
provide ample evidence of both for- 
tuitous solutions and _ misleading 
“insights.” The recurrent Gestaltist 
protest that even the errorful trials are 
“intelligent” and that the subsequent 
trials make use of what was learned 
through the error are taken here as 
equivalent to the statements that the 
problem solver had some valid general 
knowledge to begin with, and that be- 
fore acting he employed the substitute 
trial and error of thought or vision. 
The blind-variation-and-selective-re- 
tention model of thought joins the 
Gestaltists in protest against the pic- 
ture of the learning organisms as a 
passive induction machine accumulat- 
ing contingencies. Instead, an active 
generation and checking of thought- 
trials, hypotheses, or molar responses 
is envisaged. The model at the level 
of thought places essential importance 
upon internalized selective criteria 
against which the thought trials are 
checked. Poincaré’s (1913) esthetic 
criteria and the Gestalt qualities of 
wholeness, symmetry, organized struc- 
ure, and the like can be regarded as 
built-in selective criteria completely 
compatible with the model. Pringle 
(1951) for example, has proposed a 
selective-retention model of central 
nervous system action in which sys- 
tematic temporal patterns provide the 
selective criteria in a resonance process. 





390 


Nor does the model here’ presented 
specify the nature of the thought trials 
employed. There must often be a trial 
and error among general 
principles, or among rational abstrac- 
tions, or field reorganizations, or re- 
centerings. Both the blind variation 
and selective survival model, and 
Gestalt theory emphasize the advantage 
of breaking out of old ruts, and the 
disadvantages of set and rote drill 
(Boring, 1950; Dunker, 1945; Katona, 
1940; Luchins, 1942; Wertheimer, 
1959). Furthermore, the encounter- 
ing in thought of an idea which fits can 
be accompanied at the phenomenal level 
by a joyful “ahaerlebnis” or a Gestalt 
experience of “closure,” and at the 
overt performance level by a sudden 
and stable improvement signifying “in- 
sight.” There is no essential disagree- 
ment here. Nor is the trial-and-error 
model without phenomenological sup- 
port. Note the highly similar testimony 
from the disparate historical citations 
provided above, especially in the im- 
agery of multitudinous, loosened, agi- 
tated, teeming, colliding, and interlock- 
ing ideas. ; 

This is not to say that a Gestalt 
psychologist would be happy with the 
blind - variation -and - selective- retention 
description of thought processes. Nor 
are all aspects of the Gestalt syndrome 
here accepted. While “insight” is ac- 
cepted as a phenomenal counterpart of 
the successful completion of a perhaps 
unconscious blind-variation cycle, its 
status as an explanatory concept is re- 
jected, especially as it connotes “direct” 
ways of knowing. Furtherniore, when 
publicized as a part of an ideology of 
creativity, it can reduce creativity 
through giving students a feeling that 
they lack an important gift possessed 
by some others, a feeling which inhibits 
creative effort and increases depend- 
ence upon authority. Polya (1945, 
1954) has described such an inhibiting 


possible 
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tradition in the teaching of mathe 


matics. 
Individual Differences and Genius 
Another prevalent orientation anti- 
thetical in spirit to the blind-variation- 
and-selective-retention model may be 
called the “mystique of the creative 
genius and the creative act.” This is 
related to our deeply rooted tendency 
toward causal perception (e.g., Heider, 
1944), a tendency to see marvelous 
achievements rooted in equally mar- 
velous antecedents. It takes the form 
of the “fallacy of accident” and of 
“post hoc ergo propter hoc.’ Let a 
dozen equally brilliant men each pro- 
pose differing guesses about the un- 
known in an area of total ignorance, 
and let the guess of one man prove 
correct. From the blind-variation-and- 
selective-survival model this matching 
of guess and environment would pro- 
vide us with new knowledge about the 
environment but would tell us nothing 
about the greater genius of the one 
he just happened to be standing 
where lightning struck. In such a case, 
however, we would ordinarily be 
tempted to look for a subtle and special 
talent on the part of this lucky man. 
However, for the genuinely unantici- 
patable creative act, our “awe” and 
“wonder” should be directed outward, 
at the external world thus revealed, 
rather than directed toward the ante- 
cedents of the discovery. Just as we 
do not impute special “foresight” to a 
successful mutant allele over an un- 
successful one, so in many cases of 
discovery, we should not expect mar- 
velous consequents to have had equally 
marvelous antecedents. Similarly, in 
comparing the problem-solving efforts 
of any one person; from the selective 
survival model it will be futile, in the 
instance of a genuinely innovative 
achievement, to look for special ante- 
cedent conditions not obtaining for 


man 
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blind-alley efforts: just insofar as there 
has been a genuine gain in knowledge, 
the difference hetween a hit 
miss lies in the selective conditions 
thus newly encountered, not in talent 
differences in the generation of the 
trials. 


and a 


This is not to deny individual differ- 
ences in creative intellect. Indeed, 
the blind-variation-and-selective-reten- 
tion model of creative thought predicts 
such talent differences along all of the 
parameters of the process. This is to 
emphasize, however, that explanations 
in terms of special antecedents will very 
often be irrelevant, and that the causal- 
interpretative biases of our minds make 
us prone to such over-interpretations, 
to post-hoc-ergo-proper-hoc interpreta- 
tions, deifying the creative genius to 
whom we impute a capacity for direct 
insight instead of mental flounderings 
and blind-alley entrances of the kind 
we are aware typify our own thought 
Ernst Mach (1896) notes 
our nostalgia for the directly-knowing 
genius : learn 
born 
science in the 


. 
pre cesses. 


“To our humiliation we 
that even the greatest men are 
than for 
extent to which even they are indebted 


more for life 


to accident” (p. 175). 

What are the ways in which thinkers 
might be expected to differ, according 
to the trial-and-error model? First, 
they may differ in the accuracy and 
detail of their representations of the 
external world, of possible locomotions 
in it or manipulations of its elements, 
and of the Differ- 
ences in this accuracy of representa- 
tion correspond to differences in de- 
gree of information and _ intelligence. 
Second, thinkers can differ in the 
number and range of variations in 
thought trials produced. The more 
and the more varied such 
trials, the greater the chance of success. 
ain has emphasized the role of fanati- 
cism or extreme dedication in pro- 


selective criteria. 


numerous 
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ducing large volumes of such explora- 
tions. Bain, Souriau, Mach, and 
Poincaré have all emphasized the role 
of advance preparation in assembling 
the elements whose blind permutation 
and combination make possible a wide 
range of trials. Many observers have 
emphasized the role of set and famili- 
arity in reducing the range of varia- 
tions, and have recommended ways of 
reducing trial-to-trial stereotypy, as by 
abandoning the problem for awhile, 
going on to other things. Devices 
abound which are designed to increase 
the likelihood that all permutations be 
considered and are used by most of 
us, as in going through the alphabet 
in finding rhymes or puzzle words. 
There are no doubt age differences in 
the rapidity and uninhibited range of 
thought-trial production. The sociology 
of knowledge makes an important con- 
tribution here: persons who have been 
uprooted from traditional cultures, or 
who have been thoroughly exposed to 
two or more cultures, seem to have the 
advantage in the range of hypotheses 
they are apt to consider, and through 
this means, in the frequency of creative 
Thorstein Veblen (1919) 
has espoused such a theory in his essay 
on the intellectual preeminence of Jews, 
as has Robert Park (1928) in writing 
of the role of “the marginal man” in 
cultural innovation. (See also Seeman, 
1956.) And more generally, it is the 
principle of variation which leads us 
to expect among innovators those of 
personal eccentricity and bizarre be- 
havior. We can also see in this prin- 
ciple the value of those laboratories 
whose social atmospheres allow wide 
ranging exploration with great toler- 
ance for blind alley entrances. 

The value of wide ranging variation 
in thought trials is of course vitiated if 
there is not the precise application of 
a selective criterion which weeds out 
the overwhelming bulk of inadequate 


innovation. 
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trials. This editing talent undoubtedly 
differs widely from person to person, 
as Poincare (1908, 1913) has empha- 
sized. With regard to selection cri- 
teria, one further point should be 
made. Much of creative thought is 
opportunistic in the sense of having a 
wide number of selective criteria avail- 
able at all times, against which the 
thought trials are judged. The more 
creative thinker may be able to keep in 
mind more such criteria, and therefore 
increase his likelihood of achieving a 
serendipitous (Cannon, 1945; Merton, 
1949) advance on a problem tangential 
to his initial main line of endeavor 
(e.g., Barber & Fox, 1958). Further 
areas of individual differences lie in 
the competence of the retention, cumu- 
lation, and transmisison of the encoun- 
tered solutions. 

It need not be expected that these 
dimensions of talent all go together. 
In organic evolution, the variation 
process of mutation and the preserva- 
tion of gains through genetic rigidity 
are at odds, with an increase in either 
being at the expense of the other, and 
with some degree of compromise being 
optimum. Just so we might expect 
that a very pure measure of innovative 
range in thought and a very pure 
measure of rote memory might be even 
negatively correlated, as Saugstad 
(1952) seems to have found, and 
similarly for innovative range and 
selective precision. Such considera- 
tions suggest complementary combina- 
tions of talent in creative teams, al- 
though the uninhibited idea-man and 
the compulsive edit-and-record type 
are notoriously incompatible office 
mates. 

Notice regarding the individual 
differences thus described that while 
they do make creative innovation much 
more likely on the part of some in- 
dividuals than others, they do not place 
the joys of creative innovation beyond 
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the reach of the less gifted. Indeed, 
looking at large populations of think- 
ers, the principles make it likely that 
many important contributions will 
come from the relatively untalented, 
undiligent, and uneducated, even 
though on an average contribution per 
capita basis, they will contribute much 
less, points which Souriau (1881) has 
noted. The intricacy of the tradition 


to which innovation is being added of 
course places limitations in this regard. 


The Enormous Domain of Possible 
Thought-Trials to be Searched 


A final typé of objection to the 
blind - variation ~-and - selective- retention 
model of thought needs to be con- 
sidered. This objection is to the ef- 
fect that the domain of possible 
thought trials is so large that the solu- 
tion of a given problem would take an 
impossibly long time were a search of 
all possibilities to be involved, either 
through a systematic scanning of all 
possibilities where these are enumer- 
able, or through a random sampling of 
the universe of possibilities. Time and 
trial estimates thus based can be over- 
whelming, as Kurt Lasswitz’s story 
“The Universal Library” (1958) dra- 
matically illustrates. Other parodies 
of our model occur in literature as 
far back as Swift’s portrait of the 
Academy of Lagado in Gulliver's 
Travels (1941, pp. 166-169). (Ley, 
1958, traces such ideas back to Lully, 
ca, 1200.) Newell, Shaw, and Simon 
(1958a, 1958b) refer in this vein to 
what they call the “British Museum 
Algorithm,” i.e., the possibility of a 
group of trained chimpanzees typing 
at random producing by chance in 
the course of a million years all of 
the books in the British Museum. 
Such parodies seem effectively to re- 
ject the blind-variation-and-selective- 
retention model through a reductio ad 
absurdum. Needless to say, such a 
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rejection is not accepted in the present 
paper. Asa matter of fact, it is judged 
to be in the.same class as parallel 
objections to the theory of natural 
selection in evolution. Similar features 
in these two instances make the acci- 
dentalist interpretation more accept- 
able. 

1. Neither in organic evolution nor 
in thought are all problems solved, 
nor all possible excellent solutions 
achieved. There is no guarantee of 
omniscience. The knowledge we do 
encounter is achieved against terrific 
odds. (Those advocating heuristi- 
cally-programed problem-solving com- 
puters are careful not to guarantee 
solutions, and this modesty should be 
extended to all creative 
thought. ) 

2. The tremendous number of non- 
productive thought trials on the part 
of the total intellectual community 
must not be underestimated. : Think of 
what a_ small proportion of thought 
becomes conscious, and of conscious 
thought what a small proportion gets 
uttered, what a still smaller fragment 
gets published, and what a small pro- 
portion of what is published is used by 
the next intellectual generation. There 
is a tremendous wastefulness, slowness, 
and rarity of achievement. 

3. In :biological evolution and in 
thought, the number of variations ex- 
plored is greatly reduced by having 
selective criteria imposed at every step. 
Thus mutant variations on nonadaptive 
variations of the previous generation 
are never tested—even though many 
wonderful combinations may be missed 
therefore. Some of the “heuristics” 
currently employed in logic and chess 
playing machines (Newell, Shaw, & 
Simon, 1958a, 1958b) have the similar 
effect of evaluating all next-possible 
moves in terms of immediate criteria, 
and then of exploring further varia 


models of 


tions upon only those stems passing 


the screening of each prior stage. It 
is this strategy of cumulating selected 
outcomes from a blind variation, and 
then exploring further blind variations 
only for this highly select stem, that, 
as R. A. Fisher has pointed out (e.g., 
1954, p. 91) makes the improbable in- 
evitable in organic evolution. This 
strategy is unavoidable for organic 
evolution, but can obviously be relaxed 
in thought processes and in machine 
problem solving. However, the Pan- 
dora’s box of permutations opened up 
by such relaxation can be used to infer 
that, in general, thought trials are 
selected or rejected within one or two 
removes of the established base from 
which they start. In constructing our 
“universal library” we stop work on 
any volume as soon as it is clear that 
it is gibberish. 

4. When we make estimates of the 
number of permutations which would 
have to be culled to obtain a given 
outcome, we often assume that problem 
solving was undertaken with that one 
fixed goal in mind. This overlooks 
the opportunistic, serendipitous course 
of organic evolution and of much of 
creative thinking. The likelihood of 
a productive thought increases with 
the wider variety of reasons one 
has for judging a given outcome 
“interesting.” To neglect this oppor- 
tunistic multipurposedness gives one 
a poor base for estimating the proba- 
bility of encountering the one outcome 
hit upon and recorded. Thus when 
Newell, Shaw, and Simon’s “Logic 
Theorist” (1958a, 1958b) sets out to 
prove the 60-odd theorems in a given 
chapter of Principia Mathematica, it 
may face a more formidable task than 
did Whitehead and Russell in generat- 
ing them, if, except for the dozen 
classic theorems reproduced, White- 
head and Russell were otherwise free 
to record every deduction they en- 
countered which seemed “interesting” 
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or “nontrivial.” Wigglesworth (1955) 
has noted this strategy on the part of 
“pure” scientists, in commenting on the 
relationship between pure and applied 
scientists in wartime : 


In the pure science to which they were ac- 
customed, if they were unable to solve prob- 
lem A they could turn to problem B, and 
while studying this with perhaps small pros- 
pect of success they might suddenly come 
across a clue to the solution of problem C 
(p. 34). 


In presenting their case for adding 
“heuristics” to the program of the 
“Logic Theorist,” Newell, Shaw, and 
Simon have emphasized the inadequacy 
of blind trial and error. So has 
Miller (1959) in advocating the 
heuristic of searching backward from 
the goal.t There is, however, no es- 


* Miller (1959, pp. 244-246) is wrong in 
implying that the strategy of working back- 
ward from the goal eliminates the necessity 
of symbolic trial and error in creative 
thinking. His mistake comes from assuming 
that only one path leads into a goal 
goal. In the spatial locomotion problems 
from which his concrete illustrations 
and for the logic problems used by Newell, 
Shaw, and Simon, the paths into any position 
are not singular, but are instead typically 
as numerous as paths leading ovt. A pure 
strategy of working from the goal back to 
the start would thus involve exploring just 
as many permutations as the pure strategy 
of exploring all paths from the start posi 
tion. However, there is a strategy 
available to symbolic trial and error and not 
to overt trial and error, in working con- 
currently from both ends. This produces a 
dramatic advantage in the number of permu 
tations generated, and a smaller but still 
substantial gain in the number of compari- 
sons. In the instance suggested by Miller in 
which each branches into 4 alterna- 
tives and in which the start and goal turn 
out to lie 7 stages apart, either of the pure 
strategies would generate 4+ 42+4 
44++45+46+ 47 or 21,844 permutations 
(neglecting the probable achievement of 
success before exhausting the 47 generation 
of alternatives). For the mixed approach, 
the junction would be encountered at the 
third stage away from each end, or 
2(4+ 42+ 4%) or 168 alternatives had been 
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come, 


useful 
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sential disagreement between their 
point of view and the one offered 
here. By adding heuristics mechanical 
thought processes have indeed been 
made more like those of human beings, 
both in adequacy and type of errors. 
Such innovations have obviated the 
protests of those such as Wisdom 
(1952) and Mays (1956) who, while 
conceding that machines could choose 
good moves at chess or solve logic 
problems, have found the machines 
failing to imitate life just in their 
orderly inspection of all possibilities. 
Newell, Shaw, and Simon recognize 
that a machine which would develop its 
own heuristics would have to do so by 
a trial and error of heuristic principles, 
with no guarantee that any would 
work. They further recognize that 
possession of an effective heuristic 
represents already achieved general 
knowledge about the domain under 
search, and that adding to this gen- 
eral knowledge will be a blind search 
(The devices of learning 
and and of coding environ- 
mental possibilities for thought-search 


prt cess. 


visior 


there 


examined In the 
would be 21,845 comparisons involved, that 
is, the start position and each subsequent 


pure strategy, 


alternative would be compared with the 
In the mixed strategy, many more 
comparisons per alternative are required 
Each permutation must be compared with 
each of the current and previous permuta 
tions on the other stem, which in this in- 
stance 7,225 comparisons. If 
the comparisons are regarded as equally 
stly as the generating and storing of alter 
natives, the savings of the mixed approach 
over either of the pure approaches would 
amount to approximately 1 to 6, a very 
handsome gain for any heuristic. This gain 
is larger as the number of branches at each 
stage increases. Advocacy of the heuristic 
of working backwards in what is essentially 
this mixed form is present in Polya (1945), 
Wisdom (1952), and Newell, Shaw, and 
Simon (1958a, 1958b). In none of these is it 
claimed that trial and error is eliminated, 
while all point to the reduction in trials 
which it can achieve 


goal. 


amounts to 
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all represent heuristics in this sense.) 
They might agree that most 
heuristic devices will be limited to the 
specific domain of their discovery, and 
can only be extended to other domains 
on a trial basis. They would probably 
that no problem solving 
be “direct.” The 
with their 


also 


also agree 
will 
agreements | 
cellent paper on the processes of cre- 
ative thinking (Newell, Shaw, & 
Simon, 1958b) are thus minor matters 
of emphasis, but may be worth stating 
nonetheless, to further clarify the 
position here advocated. They say, for 


process dis- 


have ex 


example : 


\ 
sizes 


We'have given enough estimates of the 

r the . to cast suspicion 
upon a theory of creativity which places its 
emphasis in trial and 
63). 


spaces involved 


upon increase error 


ds 
) 


The blind-variation-and-selective-reten 
tion model unequivocally implies that 
ceteris paribus, the greater the heter- 
and volume of trials the 
greater the chance of a productive in- 
novation. Doubling the number of 
efforts very nearly doubles the chance 
of a hit, particularly when trials are a 
small part of the total domain and the 
repetitiousness among trials is low. 
But they too recognize unconvention- 
ality and no doubt numerosity as a 
necessary, if not a sufficient condition 
of creativity (1958b, p. 62). What 
they would validly stress, is the very 
frequent tactical advantage of a trial 
and error of general strategic prin- 


ogeneity 


ciples over a trial and error involving 
no classificatory effort nor attempt to 
use and, once such general 
heuristics have been the 
advantages of a hierarchized trial and 


clues, 
discovered, 
error process. The advantage of such 
a strategy depends upon the ecology, 
of course, but we are in general justi- 
fied in expecting solutions to be non- 
and to show 


randomly distributed, 
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contingencies with prior 
Polya (1945, 1954) of 
course, heen a major source of in- 
spiration for all efforts to introduce 
heuristics into problem solving, and 
for him a trial and error approach is 


a heuristic of fundamental importance. 


significant 


clues. has, 


Another minor point of disagree- 
mentioned. In_ their 
efforts to consider how a “Logic 
Theorist” might be programed to 
learn a general heuristic from hind- 
sight they propose that it keep a 
record the outcomés of all past 
trials, successful and unsuccessful, in 
order to be able to scan its experience 
for general principles of strategy 
(1958b). Implementing this would 
put a tremendous strain upon memory 
storage, and would introduce a scan- 
ning process as time consuming as the 
original search process which pro- 
duced the record. The strategy of 


ment may be 


of 


organic evolution is to keep a record 
only of what works, even at the ex- 
The gen- 


pense of repeating its errors. 
eral preponderance of wrong tries at 
every level, plus problems of memory 
glut and access, suggests a similar 
strategy for all knowledge processes. 
Heuristics can probably best be learned 
through a trial and error of heuristics, 
tried on new problem sets rather than 


old. 


STATUS AS A THEORY 


At the level here developed, one might 
better speak of an “orientation to,” or 
a “perspective on” creative thought 
processes, rather than a “theory of.” 
At many points, this perspective merely 
points to problems, rather than taking 
that step toward theory of providing 
guesses at answers: e.g., for the proc- 
here outlined to be possible, 
theory must not only provide several 
: processes, but most im- 

must specify a 
for the trial-and-error 


esses 


memory 
portantly, 
mechanism 


possible 
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search of these. . From such specifica- 
tions will come the subtlety of predic- 
tion characteristic of a developed 
theory.° While the perspective even 
in its very general state has some un- 
equivocal empirical implications, the 
major advantage to it may be meta- 
physical, or at least metatheoretical. 
Like the theory of natural selection in 
organic evolution, it provides 2n under- 
standing of: marvelously purposive 
processes without the introduction of 
teleological metaphysics or of pseudo- 
causal processes working backward in 
time. , 

Note that. there are still ambiguities 
about the status as theory of the well 
established principle of organic evolu- 
tion through natural selection, even 
though now buttressed with the de- 
tailed genetic model of the variation 
and retention processes. Scriven 


(1959) has called it an explanatory 
rather than a predictive theory. While 


the present writer feels that Scriven 
has somewhat undervalued the experi- 
mental studies of evolution with 
viruses, bacteria, and insects, and may 


5 This paper does not attempt to review 
theories in this area. Citations to the im- 
portant contributions of Pringle (1951) and 
Hebb (1949), do not begin to represent this 
literature. Note the special problem of a 
brain analogue to switching. Ashby’s 
(1952) model and most computer memory 
search involves a spatial displacement of 
solids impossible in the brain, as in the 
stepping switch or the rotation of a mag- 
netic memory drum. Computer memory 
search processes making use of timed pulses 
require a precision of timing dependent upon 
a stability of dimension presumably not 
available in the brain and usually if not 
always dependent upon a clock within which 
actual spatial displacement of solids takes 
place. It is for these reasons that Pringle’s 
(1951) theory seems particularly promising, 
and one woiiders why it has not been more 
used, or if not usable, more publicly refuted 
See Pribram (1959) for a recent contribu- 
tion to the problem of appropriate brain 
process models. 
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have mistaken the past absence of ex- 
perimental settings appropriate to test- 
ing the theory for an inherent attribute 
of the type of theory per se; he has 
called attention, nonetheless, to some 
serious problems. Even Sewall 
Wright (1960), whose statistical ge- 
netic theory of evolution has added 
subtle details to the overall description 
of the process, has commented in a 
similar vein: 


The theory is deterministic only in an ex- 
ceedingly limited sense. It is essentially a 
theory of the conditions favorable for an 
ever continuing process that is essentially un- 
predictable in its details. There can be no 
formula for serendipity (p. 148). 


The problems which a selective re- 
tention theory of creative thinking 
shares with that of evolution include 
the following : 


1. The basic insight, so useful and so 
thrilling when first encountered, is close to 
being an analytic tautology rather than a 
synthetic description of process: if indeed 
variations occur which are differentially 
selected and propagated, then an evolutionary 
process toward better fit to any set of con- 
sistent selective criteria is inevitable. 

2. For most applications of the selective 
retention model, the variation is taken as a 
descriptive given, as an unexplained part of 
the explanation. While other predictive 
theories likewise depend upon unexplained 
processes at a more molecular level, this 
instance may seem evasive at a particularly 
crucial point, and has indeed been taken as 
a denial of determinism or as a rival meta- 
physic of “spontaneous change” as presump- 
tive as a teleological one. We are currently 
getting detailed deterministic explanations 
of the mutation of genes, but until something 
comparable is available to predict the gen- 
eration of heterogeneous thought trials, this 
constitutes a weakness of the model. 

3. The biological study of the evolution of 
any species takes the form of a post hoc 
reconstruction of a unique, “undetermined,” 
historical process. The achievement of any 
general regularities must be probabilistic 
in the extreme. Studies starting from spe- 
cific spectacular achievements in creative 
thought must be similar in nature. 

4. The theory suffers from the multiplicity 
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of possible mediations it posits. Where 
there are gaps in the historical knowledge, 
the theory makes available an embarrassing 
surfeit of possible reconstructions. In this, 
and in contrast to the successful theories 
of macrophysics, the theory is less self- 
disciplining, less specific in its predictions, 
more evasive of potential disconfirmation. 
This is perhaps the most important of 
Scriven’s (1959) points, and one equally ap- 
plicable to the theory of creative thinking. 

5. In the usual applications, the environ- 
ment is not described or describable prior to 
the organismic achievement of adaptation to 
that environment. Whereas the theory deals 
with an iterative process whereby an organ- 
ism adapts to (achieves knowledge of) an 
independent environment, the evidences as 
to the organismic form and environmental 
parameters are often confounded, in that the 
same data series is used to infer both. While 
this is not so for laboratory studies of trial- 
and-error learning, it is particularly apt to 
be so for any study of truly great creative 
thinking in science. (See Campbell, 1959, 
p. 157 for epistemological citations to this 
problem. ) 


There is in addition, a serious 
problem which the blind-variation-and- 
selective-retention theory of creative 
thought faces which is not present 
in comparable degree in the modern 
theory of organic evolution. This is 
the unfavorable ratio of hypothesized 
unobservable processes to observable 
input-output variables. Note that even 
in its sketchy form here given, some 
6 to 14 or more separately variable 
parameters are implied. These in- 
clude: (a) A mnemonic representation 
of environment, varying perhaps in 
scope, accuracy, and fineness of detail ; 
(b) A mnemonic search or thought- 
trial process, varying in the accuracy 
with which it represents potential overt 
exploration; (c) A thought-trial gen- 
erating and changing process, varying 
in rate, heterogeneity, idiosyncrasy, and 
lack of repetitiousness among succes- 
sive thought trials; (d) Selective crite- 
ria, varying in their number, accuracy 
of representation of environmental con- 
tingencies, and precision, sharpness, or 


selection ratio; (e¢) A preservation or 
propagation process, providing a reten- 
tion for selected thought trials of a 
quite different order from the memory 
traces of the nonselected ones, varying 
perhaps in accuracy and accessibility ; 
(f) A reality testing process in which 
the selected thought trials are checked 
out by overt locomotion in the external 
environment, varying perhaps in sensi- 
tivity to disconfirming feed-back. 

This inventory of weaknesses does not, 
of course, represent argument in favor 
of rival theories of creative thought, 
which are judged to be still more 
amorphous, still less adequate. And as 
Duncan (1959) makes clear in review- 
ing the research literature on human 
problem solving, for all theories there 
is lacking a disciplined relation both 
to experimental undertakings and to 
findings. Even in its present form, 
however, the theory contains many 
empirical implications. Manipulation 
of any one of the 14 variables just 
listed should increase the number of 
creative products, providing the other 
variables can be held constant. Pre- 
dictions of this order have been speci- 
fied in the discussion of individual dif- 
ferences. Particularly characteristic 
are the unequivocal predictions re- 
garding the volume and heterogeneity 
of thought trials. Ceteris paribus, a 
creative solution is more likely the 
longer a problem is worked upon, the 
more variable the thought trials, the 
more people working on the problem 
independently, the more heterogeneous 
these people, the less the time pres- 
sure, etc. 


SUMMARY 


This paper has attempted to make 
the psychological and epistemological 
point that all processes leading to ex- 
pansions of knowledge involve a blind- 


variation-and-selective-retention 
such as vision 


proc- 


ess. Processes, and 
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thought, substituting for an overt trial 
and error are of couse acknowledged. 
But each of these are interpreted as 
containing in its very workability wis- 


dom about the environment obtained 
originally by the blind variation of 
mutation and natural selection. In 


addition, each contains a blind-varia- 


tion-and-selective-retention process at 
Supporting the effort 
to interpret all knowledge processes 
in this light has been an emphasis 
upon the tremendous gain in knowl 
edge in the course of evolution and 
history, a gain which can only be ex 
plained by a continual breakout from 
the bounds of what already 
known, a breakout for which blind var 
provides the 
available. 


its own level 


was 


iation only mechanism 

The application of this perspective 
to the process of creative thought 
antedates its application to trial-and- 


error learning in animals and to or- 
ganic evolution, and is _ illustrated 


through quotations from Bain, Souriau, 
Mach, and Poincaré. The model is 
not in disagreement with the bulk of 
the Gestalt problem 
solving, nor the work on heuristically 
programed problem-solving computers 
effort to 
out a prevailing implicit belief in the 
possibility of “direct” or “insightful” 
creative thought processes 


description of 


However, there is an root 
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